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Modified Virus 

The present invention relates to novel recombinant 
• 5 viruses suitable for use in gene therapy. Such 

recombinant viruses exhibit an altered tropism conferred 
by incorporation of one or more non-native polypeptides 
into one or more viral components, or the replacement of 
such components by non-native polypeptides. These non- 
10 native polypeptides can comprise elements which 

mimic the structure of the original viral component so 
as to permit inclusion of the polypeptide in functional 
virion particles. These non-native polypeptides can 
also comprise elements which confer a non-native ligand 
15 binding function (i.e. altered tropism) to such virion 
particles. Essentially, the non-native polypeptides 
employed in the invention have primary and secondary 
amino acid structures that enable their correct folding 
in the nucleus or cytosol of mammalian host cells and 
20 their subsequent transport through the nuclear membrane. 
Such structures permit the assembly of functional 
recombinant virion particles with an intact non-native 
binding function and thus an altered tropism. 

25 Clinical gene therapy was introduced in 1989. The aim 
at that time was to correct gene defects in the immune 
system via the in vitro introduction of a healthy gene 
into the defective cells of the patient and transfusion 
of the treated cells back to the patient. Since that 

30 time, the indications and possible molecular uses of 
gene therapy have increased dramatically. Today, ten 
years after its introduction, one can envisage the use 
of gene therapy to treat e.g. vascular diseases, cancer, 
inflammatory diseases and infectious diseases such as 

35 HIV. 

At present, however, gene therapy is still not a useful 
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method in human medicine. One main reason is that gene 
therapy demands the packaging of the genes to be 
delivered into gene-carriers, or vectors, which can be 
administered to patients and which will target the genes 
5 only to the intended cells. Such vectors have so far 
not been available in reliable form. 

Several classes of viruses have been considered as 
vectors for gene therapy applications, the most commonly 
10 used being adenoviruses, retroviruses, lentiviruses and 
adeno-associat'ed viruses. 

Ideal vectors for gene therapy would be those which can 
be administered systemically and yet deliver the desired 

15 genetic material specifically to desired cells or 

tissues. However, the viruses currently considered for 
human gene therapy applications have a broad tropism, 
being able to infect many different types of cells in 
the human body. This limits the potential safety of 

20 viral vectors and prohibits their use for systemic 

administration. For this reason, the development of 
targeted virus vectors, capable of infecting only 
selected cells, has been described as the holy grail of 
gene therapy. 

25 

One of the most widely investigated group of vectors for 
gene therapy applications is the adenoviruses. Human 
adenoviruses are divided into six hemagglutination 
groups (A-F) and each hemagglutination group is further 
30 subdivided into different serotypes. All in all there 

exist more than 40 different human adenovirus serotypes. 
The adenovirus which has been most frequently used for 
human gene therapy is adenovirus type 5 (Ad5) which 
belongs to hemagglutination group C. 

35 

Adenoviruses (Ad) are DNA viruses without an envelope, 
shaped as regular icosahedrons with a diameter of 60-85 
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nm. 

The adenoviral capsid comprises 252 capsomeres, 240 
hexons and 12 pentons (Ginsberg et al., Virology, 28, 
5 782-83 (1966)). The hexons and pentons are derived from 
three viral proteins. The hexon comprises three 
identical proteins of 967 amino acids each, namely 
polypeptide II. The penton contains a base, which is 
bound to the capsid, and a fiber, which is non- 
10 covalently bound to, and projects from, the penton base. 
Proteins IX, yl and Ilia also are present in the 
adenoviral coat and are thought to stabilise the viral 
capsid . 

15 Cell binding takes place through the fiber proteins, 
anchored to the virion at the vertices of the 
icosahedron. The fiber protein is not necessary for 
assembly and release of intact virions. Assembly of 
virions take place in the nucleus of infected cells. 

20 

The fiber protein, which is a homotrimer of a fiber 
polypeptide (namely adenoviral polypeptide IV), contains 
three functionally different parts: an N-terminal tail 
anchoring the fiber non-covalently to the penton base in 

25 the virion and which furthermore contains the 

nuclear-localization signal; a shaft domain comprising a 
variable number of repeats of a -15 amino acid fiber 
shaft motif, (e.g. which is repeated six times in Ad3 
and 22 times in Ad2 and Ad5) ; and a Oterminal globular 

30 domain, the knob, which contains the ligand which binds 
to the cellular Ad-receptor (Chrobozek et al., 
Microbiology and Immunology, (1995), p. 163-200). The 
knob is also functionally responsible for fiber 
trimerisation (i.e. it incorporates a trimerisation 

35 motif) . 

Each shaft repeat has two three-amino acid regions which 
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form beta-sheets and two amino acid regions which 
constitute the turns of the native extended fiber shaft. 
The crystal structure of the trimerised, cell binding 
domain has been determined and shows a unique topology 
5 different from other anti-parallel p-sandwiches (Xia et 
al., Structure 2: 1259-1270, (1993)). Binding of the 
fiber to the penton base of the virion can take place 
also in a cell-free system, i.e. the fiber can bind to 
fiberless virions (Boudin et al., Virology, 116: 589- 
10 604, (1982)). 

Efforts to modify adenovirus fibers (e.g. produce 
recombinant fiber proteins) in order to modify the 
properties of adenovirus vectors, for example by 
15 altering tropism or cell binding, have been made and 
have been reported in the prior art. 

The adenovirus fiber protein performs several biological 
. functions which must be retained in order to produce 
20 active virus particles. The following fiber features 
are deemed to be of key importance in the construction 
of functional modified fiber proteins: 

i) the ability to form parallel homotrimers. This 
function is mediated by the N-terminal amino acid 

25 sequence of the wild type fiber knob. 

ii) the ability to bind to the penton base to form 
penton capsomeres. This function is mediated by the 
wild type fiber tail. 

iii) the ability to express a cell binding ligand 

30 allowing for attachment to target cells. In the native 
situation, this function is mediated by the wild type 
fiber knob (which binds to the cellular Ad-receptor) . 

iv) the capability of transport into the nucleus of 
infected cells, v/hich is vital to virus formation. This 

35 function is mainly, but perhaps not exclusively, 
mediated by the wild type fiber tail. 
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Previous attempts to change the tropism of adenovirus 
have involved genetic modification of fibers and knobs. 
However, this approach has not proved to be very 
successful. A major problem has been the incorporation 
5 of novel ligands in a functional context, which are 

capable of changing the tropism without interfering with 
the trimerisation of the fiber. For example, a short 
peptide ligand has been added C-terminally of the knob 
(Michael et al., Gene Therapy, 2: 660-8, (1995)) and a 

10 nonapeptide has been introduced into one of the knob 
"loops" (Dmitriev et al., J. Virol., 72 (12) : 9706-9713 
(1998)). However, the knob has a very complex structure 
due to interactions between the three fiber subunits 
which are necessary to conserve cell binding and 

15 trimerisation. Therefore, the knob only tolerates 

insertion of a few amino acids and no general method for 
construction of functional, genetically re-targeted 
adenovirus fibers exists to date. 

20 Attempts have also been made to introduce new ligands 

into other parts of the adenovirion. By introducing the 
FLAG tetra-amino acid motif into the Ad penton, it has 
been shown possible to target Ad to cells normally not 
infected by Ad. The re-targeting was achieved by 

25 targeting with a bi-specific antibody where one 

specificity was directed against the FLAG peptide and 
the other against the new target cell (Wickham et al., 
J. Virol., 70: 6831-6838, (1996)). 

30 A previously unaddressed problem encountered in the 
production of efficient recombinant viruses for gene 
therapy, is that of ensuring the functional folding of 
recombinant components upon expression in the nucleus 
and cytosol of host cells. This is particularly 

35 relevant where the wild-type virus to be engineered 
employs component expression in these intracellular 
locations during replication, for example, as in 
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adenovirus. Folded protein structures, particularly 
those which rely on disulphide bonds in cysteine bridges 
to maintain a functionally correct conformation, for 
example those derived from antibodies, can be rendered 
5 mis-folded and non-functional when expressed in the 

reducing environment of the nucleus and cytosol as part 
of a recombinant viral component. Thus, there exists in 
the art an unsatisfied and previously unappreciated need 
for protein structures which can retain a functional 
10 conformation when expressed as viral components in these 
cellular compartments, such that the binding functions 
which the structures comprise or support retain a 
binding function, particularly in the absence of the 
ligand for which binding is desired. 

15 

There are therefore major problems associated with the 
genetic engineering of Adenovirus fibers useful in the 
construction of recombinant re-targeted adenovirus 
(Ad-virus) for human gene therapy. These problems are 

20 deemed to be particularly important since more patients 
have been treated with adenovirus vectors than with any 
other type of vector (Trapnell et al . , Biotechnology, 5: 
617-625, (1994)). The present invention is directed 
towards circumventing such problems in the construction 

25 of genetically re-targeted viruses for gene therapy, 

where the new viral tropism has been accomplished by the 
introduction of a new cell binding ligand into a viral 
component protein. 

30 The invention described herein relates in part to the 
production of functional recombinant adenoviral fiber 
proteins with a new tropism facilitated by removing or 
ablating (e.g. blocking or inactivating) the native cell 
binding domain and either replacing it with, or adding, 

35 a non-native polypeptide comprising an external cell 
binding ligand and an external trimerisation motif. 
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Curiel et al. in WO 99/41359, Wickham et al. in WO 
98/54346 and Spooner and Epenetos in US 5,885,808 
disclose various modifications to adenoviral components 
with the aim of producing recombinant viruses with an 
5 altered tropism. None of the these disclosures provide 
a solution to, or indeed address or even recognise, the 
problems associated with the expression of functional 
non-native viral components in the nucleus and cytosol 
of host cells, a problem solved by the present 
10 invention. 

Thus, in one aspect the invention provides a modified 
virus comprising a non-native polypeptide, which 
polypeptide comprises one or more framework moieties 

15 each containing one or more binding moieties, which 
polypeptide is capable of being expressed in the 
cytoplasm and nucleus of a mammalian host cell in a 
conformation which is maintained in the absence of a 
ligand for said binding moieties, said conformation 

20 allowing said binding moieties subsequently to bind with 
said ligand, and which polypeptide is capable of 
transport though the nuclear membrane, wherein said 
modified virus has an altered tropism conferred by said 
binding moieties. 

25 

A "modified" virus according to the invention is thus a 
virus which differs to a native (i.e. wild-type) virus. 
Generally speaking, the virus is modified such that a 
component of the virus is altered structurally over a 

30 native or wild-type (i.e. unmodified) component, or such 
that a structural component or feature is added to the 
virus, which is not present in the native or wild-type 
form. Advantageously, according to the present 
invention, a property or behaviour of the component is 

35 altered. In other words, the modified virus, differs 

functionally over an unmodified (native/wild-type) virus 
(e.g. by exhibiting an altered tropism) . As discussed 
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above (and also further below), the virus is 
conveniently modified using genetic engineering 
techniques. Accordingly, a modified virus according to 
the present invention is advantageously a recombinant 
5 virus. 

A modified virus according to the invention can be 
derived from any virus, and in particular any virus 
which may be used as the basis of a viral vector for 

10 gene therapy. Representative viral families include 
adenoviruses, .retroviruses, lentiviruses and adeno- 
associated viruses and particularly include members of 
the family Adenoviridae or other virus families where 
viral structural components are synthesized and/or 

15 assembled in the nucleus or cytosol of the host cell, 
such as Reoviridae, Picornaviridae, Parvoviridae, 
Papovaviridae and Caliciviridae . In a preferred aspect, 
modified viruses of the invention are modified forms of 
adenoviruses, in particular Human adenoviruses and more 

20 particularly Human adenovirus type 5. 

"Non-native polypeptide" as defined herein is a 
polypeptide sequence of two or more amino acids, the 
complete sequence of which is not found in a 
25 functionally equivalent position in the amino acid 

sequence of the wild-type virus, or more particularly in 
the wild-type viral component protein to be engineered. 

In certain preferred embodiments of the invention, the 
30 "non-native polypeptide" may also be non-native in the 
sense of not occurring in nature, i.e. being a 
synthetically or artificially constructed or prepared 
polypeptide. 

35 According to the present invention, the conformation of 
the non-native polypeptide after expression in the 
intracellular environment and, more importantly, also 
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its conformation in the extracellular environment, 
should be such that it is capable of binding to a 
desired ligand, e.g. a receptor or other molecule 
expressed on the outer surface of a target cell for the 
5 modified virus. 

As mentioned above, advantageously according to the 
present invention, the non-native polypeptide, when 
expressed in the cytoplasm of a mammalian cell has a 

10 conformation which may be maintained in the absence of a 
ligand for the' binding moiety(ies) of the polypeptide. 
Such a conformation is thus a stable conformation. In 
other words, upon expression in the cytoplasm, the 
polypeptide assumes a conformation which is maintained 

15 in the cytoplasm, and when the polypeptide is 

transported into the nucleus and incorporated or 
assembled into a viral particle. Moreover, the 
conformation is such that the binding moiety or moieties 
of the polypeptide is or are able to bind to their 

20 ligand, when exposed to it. 

Thus, an important feature of the non-native polypeptide 
of the invention is that it is able to fold correctly in 
the cytoplasm or cytosol of a mammalian cell i.e. assume 

25 a tertiary or three-dimensional structure which permits 
the functionality of the binding moiety (ies) to be 
retained (i.e. that the binding moieties retain their 
binding activity towards their ligand) . In other words, 
the binding moiety remains capable of functional binding 

30 to its ligand. 

The solubility of recombinant proteins is considered a 
good indicator of their correct folding. Therefore non- 
native polypeptides of the invention may advantageously 
35 be chosen not only on the functionality of binding 

moieties, stability of conformation and ability to form 
part of an assembled virion, but also on the basis of 
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solubility within cells in which a non-native 
polypeptide , or a viral component protein containing or 
comprising a non-native polypeptide, are expressed. For 
example, recombinant fiber proteins may be tested for 
5 their total expression in eukaryotic (e.g. insect) cells 
and the proportion recovered in the soluble fraction of 
the cell lysates determined. 

The Inventors have found that phenotypic analysis of 
10 fiber-ligand fusion constructs expressed as recombinant 
proteins in ba/culovirus- infected insect cells showed 
that their degree of solubility correlated with the 
recovery of viable recombinant adenovirus. Therefore, it 
is preferred that any non-native polypeptide of the 
15 invention, or a viral component protein containing or 
comprising a non-native polypeptide, for example a 
fiber-ligand fusion construct, is characterised with 
regard to its solubility and target cell attachment as 
recombinant protein, prior to the re-insertion of the 
2 0 corresponding gene into the viral, for example 
adenovirus genome. 

Thus the invention also provides a modified virus 
wherein said non-native polypeptide is selected using a 

25 solubility assay of the non-native polypeptide or the 
viral component protein comprising the non-native 
polypeptide. The non-native polypeptide is selected such 
that greater than 25%, preferably greater than 30%, more 
preferably greater than 50% or even 70%, of the 

30 non-native polypeptide or the viral component protein 

comprising the non-native polypeptide is present in the 
soluble fraction of cell lysates of cells expressing the 
non-native polypeptide or viral component protein 
comprising the non-native polypeptide. A suitable assay 

35 is described in the examples herein which may be applied 
mutatis mutandis to other cell expression systems. 
Reference herein to a non-native polypeptide or a viral 
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component comprising said non-native polypeptide being 
soluble in the cellular environment should be 
interpreted in line with these limits, i.e. when at 
least 25-30% of the polypeptide is present in the 
5 soluble fraction of a cell lysate, the polypeptide can 
be considered 'soluble 1 . 

Thus, according to a further aspect, the present 
invention provides a method of determining the 

10 suitability of a non-native polypeptide (e.g. a 
recombinant vital fusion protein) for use in the 
preparation of a viral vector by determining its 
solubility in a cell system. More specifically is 
provided a method of assaying the solubility of a 

15 non-native polypeptide or a modified viral protein 

component of the invention, comprising the steps of i) 
expressing said non-native polypeptide or a viral 
component protein comprising said non-native polypeptide 
in permissive cells; ii) subjecting the cells to lysis 

2 0 to produce a cell lysate; iii) separating the soluble 
and insoluble fractions of the cell lysate; iv) 
analysing the soluble and insoluble fractions of the 
cell lysate for the content of said non-native 
polypeptide or viral component protein comprising said 

25 non-native polypeptide; and, v) comparing the relative 
content of said non-native polypeptide or viral 
component protein comprising said non-native polypeptide 
in the soluble and insoluble fractions to determine the 
solubility. 

30 

A ligand for a binding moiety of the invention may be 
regarded as the ligand which corresponds to the binding 
moiety, or in other words, the ligand to which the 
binding moiety was designed or selected to bind. The 
35 ligand is thus capable of binding to the binding moiety. 
The ligand and its binding moiety may thus be regarded 
as members of an affinity-binding pair, the ligand being 
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a binding partner for the binding moiety. 

The binding moiety thus has a binding specificity for, 
or is capable of binding specifically to, a desired 
5 ligand. By "binding specificity" or "binding 

specifically" it is meant that the binding moiety is 
capable of binding to the desired (or "target") ligand 
in a manner which is distinguished from the binding to 
non-target molecules or ligands. Thus, the binding 
10 moiety either does not bind to non-target molecules or 
exhibits negligible or substantially-reduced (as 
compared to the target ligand) e.g. background, binding 
to non-target molecules. The binding moiety thus 
specifically recognises the target ligand. 

This specificity of binding of the binding moiety thus 
permits the modified virus to be selectively targeted. 
In other words, the binding moiety may be designed or 
selected to enable the modified virus to bind to a 
20 desired or "target" cell. A binding moiety may be 

designed or selected which binds to a ligand expressed 
by or on a target cell, e.g. on the cell surface. 

The ligand may thus be any desired ligand, and 
25 advantageously will be a molecule expressed on the 
surface of a target cell, in which it is desired to 
achieve expression of the modified virus. The ligand 
may thus be a cell surface receptor, or a cell-surface 
antigen. The ligand may conveniently be a protein or 
30 polypeptide molecule, but may be of any molecular 
nature, for example a lipid or carbohydrate. 

In this manner, the modified virus may be modified to 
bind to a desired target cell to which the native (wild- 
35 type) virus from which it is derived does not bind, or 
it may be modified to have a more restricted binding 
specificity than the wild-type virus, in other words, to 
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bind to only a selected or particular sub-set or type of 
target cell from among a broader population of target 
cell types to which the wild-type virus binds. The 
tropism of the virus is thus altered. Hence, by 
"altered tropism" it is meant that the modified virus 
exhibits a target cell binding specificity which is 
altered, or different, to that of the wild-type virus 
from which it is derived. 

Generally, a non-native polypeptide according to the 
invention comprises at least one framework moiety and 
one or more binding moieties and is capable of 
interacting with other viral components to form a 
functional and infective virion particle. The ability 
to form part of a functional virion particle can be 
facilitated by the presence of one or more sequences 
within the non-native polypeptide that effect binding to 
other viral components by the non-native polypeptide 
itself, or a viral component in which the non-native 
polypeptide is comprised, in the virion assembly 
process . 

A non-native polypeptide of the invention may replace or 
be incorporated into any viral component protein which 
is capable of interacting with a target cell. The 
modified (e.g. recombinant) viral component possesses a 
cell binding function either by the nature of a retained 
native structure of the viral component itself or, by a 
new structure conferred upon it by the incorporation of 
the non-native polypeptide and any structures comprised 
therein. In a preferred aspect of the invention, the 
non-native polypeptide is introduced or incorporated 
into or forms a fusion protein with, a viral protein 
component of the wild type virus, for example, an 
adenoviral fiber protein, and especially preferably it 
is incorporated such that the wild-type fiber knob (or 
at least the cell-binding domain thereof) is removed 
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(i.e. replaced) . 

However, in an alternative embodiment, a wild-type fiber 
knob (or cell-binding domain thereof) may be retained, 
and a further, or additional, cell-binding domain may be 
5 added by virtue of the binding moiety (ies) of the non- 
native polypeptide. It is a feature of the invention 
that the cell binding functions are altered from those 
of the wild type virus to be engineered, such that an 
altered viral tropism is present in the modified virus. 
10 Where the wild-type fiber knob (or cell-binding domain 
thereof) is retained, this altered tropism may be 
achieved (or facilitated) by modifying the wild-type 
knob/cell-binding domain thereof to inactivate or block 
the native or wild-type cell binding function. 

15 

As will be discussed further below, it may in certain 
circumstances be desirable to construct or engineer a 
modified virus in which control (e.g. temporal control) 
may be exhibited on the expression of the altered 

20 tropism. Thus, in such a modified virus, a native or 
wild-type tropism may be retained, by retaining the 
wild-type fiber knob or cell-binding domain thereof, in 
addition to providing the non-native polypeptide which 
confers altered tropism. For example, this is desirable 

25 for the propagation of a modified virus in conventional 
cell lines known in the art via the additional presence 
in the modified virus of a wild type binding function, 
for example through the controlled expression of a wild 
type adenoviral fiber gene from an inducible control 

30 element (e.g. promoter). 

The wild-type tropism may be ablated when desired by 
controlling (e.g. preventing) expression of the 
additional wild type viral component . genes, or after 
35 expression, by removing. or inactivating the wild type 
knob/domain, in such a manner that altered tropism 
conferred by the non-native polypeptide may be 
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expressed. An "altered tropism" according to the 
invention thus includes a "potential" altered tropism, 
i.e. the potential to express an altered tropism. It 
also includes an altered tropism which is additional to 
5 a wild-type tropism. 

As mentioned above, the modified virus of the invention 
is preferably prepared using genetic engineering 
techniques and in preferred embodiments of the 

10 invention, the non-native polypeptide is provided as 

part of a fusi'on protein with a viral protein component, 
preferably as a fusion protein with an adenoviral fiber 
protein. Such a fusion protein, or more generally, such 
a modified viral component protein, represents a 

15 separate aspect of the present invention. In a 

preferred embodiment of this aspect of the invention, 
the modified viral component is a modified adenoviral 
fiber protein comprising a non-native polypeptide as 
defined above. 

20 

Techniques for preparing such non-native polypeptides 
and introducing them into viruses or viral components 
are well known in the art and widely described in the 
literature. Thus, for example, molecular biology or 

25 genetic engineering techniques are readily available, to 
prepare or construct genetic sequences capable of being 
expressed as a modified virus, or viral component, 
according to the present invention. As described 
further in the Examples below, a nucleic acid molecule 

30 or nucleotide sequence encoding a viral component 

protein, such as the adenoviral fiber protein, may be 
modified so as to introduce a nucleotide sequence 
encoding the non-native polypeptide, for example so as 
to encode a fusion protein comprising all or part of an 

35 adenoviral fiber protein and the non-native polypeptide. 

Depending upon the viral component protein into which 
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the non-native polypeptide of the invention is 
incorporated, or which component (s) it replaces, a non- 
native polypeptide according to the invention can 
optionally comprise a further element which mimics the 

5 native structure or function of the viral component 

(i.e. is a functional equivalent of the viral component) 
so as to facilitate the assembly of functional virion 
particles. In the case of modified adenoviral fibers 
according to the invention, the functional integrity of 

.0 the adenoviral fiber with regard to virion assembly, and 
particularly q'apsid assembly, is maintained by the 
presence of external amino acid trimerisation motifs 
such as the helical amino acid motif derived from the 
neck region of human lung surfactant protein D (Hoppe et 

5 al., FEBS Letters, 344: 191-195 (1994)). This and other 
trimerisation motifs known in the art can be 
functionally engineered into the fiber shaft and act as 
a fiber trimerisation signal to create knob-less fibers. 
For example, trimerisation motifs suitable for inclusion 

0 in modified viruses are described in WO 98/54346 and WO 
99/41359. In a preferred embodiment of the invention, 
the non-native trimerisation motif present in the non- 
native polypeptide is the neck region peptide from human 
lung surfactant D. 

5 

As described in the Examples below, it may be convenient 
or necessary for any such additional or external motif 
or feature to be incorporated into the virus by means of 
a "linker" sequence. Such construction techniques for 
0 incorporation of DNA or amino acid sequences, via 

attachment to a linker sequence are known in the art and 
are within the routine skill of a protein/genetic 
engineer . 

5 A "framework moiety 11 as defined herein, is a polypeptide 
(e.g. a protein) structure which retains a functional 
(e.g. folded) structure (or conformation) in the nuclear 
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and cytosolic cellular environments. Such functional 
structures are structures which allow a binding moiety 
attached to, or incorporated within, the framework 
moiety to retain ligand binding conformation in the 
absence of the ligand. This facilitates subsequent 
binding to that ligand, for instance, once the 
reassembled virion has left the cellular environment. 

A framework moiety may thus be regarded as a type of 
"molecular scaffold 11 structure, which provides a 
framework to support or "hold" the binding moiety in an 
appropriate presentation or conformation to permit 
binding to its ligand. The framework moiety also 
provides the intramolecular interactions making a stable 
conformation in the cytosol possible. The framework 
moiety may thus be a protein or polypeptide molecule, 
which assumes a particular conformation or structure, 
and which tolerates modification in a particular region 
or regions, for example modification by amino acid 
sequence addition, insertion, deletion or substitution, 
or indeed insertion of a polypeptide sequence. 

A "binding moiety" as defined herein, is thus a 
polypeptide structure attached to or comprising part of 
a framework moiety of the invention and which retains a 
binding function for a desired (target) ligand after 
expression of the non-native polypeptide of which it is 
part in the nucleus and cytosol of a host cell. 

The binding moiety may be a contiguous or non-contiguous 
sequence of amino acids, and may be viewed as providing 
a binding site for the target ligand. A binding moiety 
may thus be provided by a region of a protein or 
polypeptide molecule, for example a surface region, e.g. 
a number of "surface" amino acid residues. 

In certain preferred embodiments of the invention, the 
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framework and binding moieties may be of different 
origin, e.g. obtained from, or derived, from different 
sources, (e.g. from different proteins), for example by 
"grafting" or linking a desired binding moiety onto or 
5 to a desired framework. In other preferred embodiments, 
a binding moiety may be created within a protein 
"framework" structure, by modifying certain regions or 
residues of the framework protein, as described further 
below. In such an embodiment, the framework moiety may 
10 be represented by the "constant" or unmodified residues, 
and the bindin'g moiety by the "variable" or modified 
residues . 

A non-native polypeptide according to the present 
15 invention may thus be a combinatorial protein, that is a 
protein made by randomisation (random mutagenesis) of a 
particular protein structure, to generate a binding 
protein with novel, modified or enhanced binding 
characteristics. Such synthetically constructed 
20 "artificial" (in the sense of non-native) proteinaceous 
affinity binding molecules (i.e. proteins engineered to 
possess a particular or novel binding function) are 
known generally in the art. 

25 Such combinatorial proteins can be prepared using 

various peptides and proteins as starting structures 
(Nygren and Uhlen, Current Opinion in Structural 
Biology, 7:463-469, 1997). Such proteins are known in 
the art, and may typically be prepared by random 

30 mutagenesis of a target protein, expression of the full 
library of these variants, e.g. on the surface of 
filamentous bacteriophage, followed by selection of a 
protein exhibiting the desired binding characteristics. 
This selection may typically involve a binding reaction 

35 between the variant protein and a target ligand (binding 
partner) , which conveniently may be immobilised, and may 
be carried out in vivo or in vitro. The mutagenesis is 
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random in that the resulting amino acid encoded by any 
particular codon is not generally pre-determined but the 
positions where mutations are to be . introduced are 
generally identified in advance. The mutagenesis may 
5 involve amino acid substitution, deletion, or addition 
(e.g. insertion) . 

Preferred combinatorial proteins for use as non-native 
polypeptides of the invention are proteins known 

10 generally in the art as affibodies. The term "affibody" 
as used herein/ defines an affinity binding molecule 
which is derived from a bacterial receptor protein, or 
binding domain thereof, wherein the binding domain is 
modified (e.g. by protein/genetic engineering) to modify 

15 (e.g. alter or enhance) the binding properties thereof. 

Advantageously, the affibody is a non-native protein (in 
the sense of not occurring in nature) and is further 
preferred to have a novel binding site. Examples and 
further descriptions of such protein molecules are given 

20 in WO 95/19374. 

The use of an expression system such as surface display 
on phage provides a crucial link between genotype and 
phenotype; there is a self-contained unit which can be 

25 selected on the basis of its specific binding 

interactions and which also carries the nucleic acid 
encoding for the protein responsible for the observed 
binding characteristics. This enables expression in 
useful amounts of the protein selected for its binding 

30 characteristics, such expression typically taking place 
in a transformed bacterial host. 

The protein, selected by its ability to bind to a target 
ligand (e.g. a desired cell surface molecule) may then 
35 be used to prepare the modified virus or viral component 
of the present invention, or more particularly a 
nucleotide sequence encoding the desired combinatorial 
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protein may be so used. 

Techniques for construction of a combinatorial library 
of protein molecules and subsequent selection to obtain 
5 proteinaceous binding molecules having desired binding 
characteristics are known in the art (Nygren, P. and 
Uhlen, M. Current Opinion in Structural Biology (1997) 
1_: 463-469) . Generally, a protein molecule, perhaps 
having intrinsic beneficial properties such as 

10 temperature or pH insensitivity , or conformational 

stability, is ./used as a "scaffold" or "framework" and a 
combinatorial library is then constructed via random but 
targeted amino acid substitutions , (or other mutations) 
of that protein molecule, in order to produce a library 

15 of molecules having different binding characteristics. 
Surface residues are generally targeted for random 
mutagenesis . 

In addition to phage display technology (Smith et al., 
20 Meth. Enzym. 217, 228-57, (1993)), other methods for 

library construction and selection include, for example, 
ribosomal display (Hanes et al., Proc. Natl., Acad. Sci. 
USA 94: 4937-4942 (1997)), peptides-on-plasmids (Schatz 
et al., Methods Enzymol., (1996) 267: 83-109), RNA- 
25 protein fusion (Roberts et al., Proc. Natl. Acad. Sci. 
USA 94: 12297-12302 (1997)) and DNA-protein linkage 
(STABLE) (Doi et al . , FEBS Lett, 457(2): 227-30, (1999)). 

Suitable protein frameworks may simply be linear 
30 peptides but preferably the framework will possess a 
folded three dimensional structure which has the 
potential for higher affinities and is less susceptible 
to proteolytic degradation. Although a framework may be 
designed de novo, naturally existing proteins or domains 
35 are usually selected for further engineering. For the 
avoidance of doubt, it is to be noted that throughout 
this specification the word "protein" is used to refer 
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to whole protein molecules as well as domains or 
fragments thereof, polypeptides or peptides. 

The choice of protein framework depends on several 
5 parameters including an ability to be effectively 

expressed in a desired host cell (e.g a mammalian cell). 
The protein should also comprise sufficiently large 
regions on its surface which are tolerant to 
substitution (or insertion or deletion etc.) without 
10 losing the overall three dimensional structure. If the 
library is to be produced synthetically, a small overall 
size is a prerequisite. Where the selected framework 
protein has a binding function, amino acid residues 
involved in that interaction may be a target for 
15 randomisation. Randomisation may be performed in order 
to enhance known binding properties or to develop 
binding molecules with new specificities. 

Suitable framework molecules are discussed in Nygren et 
al. (supra) and include cyclic peptides in a constrained 
sequence (the number of amino acid residues in such a 
constrained sequence is not critical and can be 5 or 
more, e.g. 5 to 10 or more, e.g. 40 or more), 
immunoglobulin-like scaffolds including Fv or single- 
chain (scFv) domains, bacterial receptors such as the 
58-residue one-domain Staphylococcal protein A (SPA) 
analogue Z (the "Z Domain" being a derivative of the B 
domain of SPA) , or other domains or analogues of SPA, 
DNA-binding proteins particularly zinc fingers and 
protease inhibitors . 

Of particular interest is the bacterial receptor domain 
Z. Nord et al., in Protein Engineering 8 ( 6) : 601-608 
(1995), describe a method of constructing a 
35 combinatorial library of protein molecules based on the 
Z domain, which can be applied to a range of framework 
molecules. The method described is solid-phase-assisted 



25 



WO 02/08263 



PCT/G B0 1/03252 



- 22 - 

and based on the stepwise assembly of randomised single- 
stranded oligonucleotides. 



As an alternative to modifying amino acid sequences or 
5 residues within a molecule to create a binding moiety, a 
binding moiety may be introduced to a framework moiety, 
for example by insertion or addition of a polypeptide 
constituting or comprising a binding moiety. The 
binding moiety may thus be attached to a framework 
10 moiety. In such a case, a binding moiety may be 

provided by an! affinity binding partner for the desired 
target ligand. 

Preferred binding moieties of the invention may be 
15 derived from, but are not limited to, ligands (i.e. 

binding partners) for cell surface receptors, anti- 
receptor antibodies (or antibody fragments or 
derivatives) , cell specific peptides, single chain 
antibodies (ScFv), single domain antibodies, and minimal 
20 recognition units of antibodies such as a complementary- 
determining regions (CDRs) of Fv fragments. A binding 
moiety may thus be obtained or derived from the antigen- 
binding site or antigen binding or recognition/region ( s ) 
of an antibody, and such an antibody may be natural or 
25 synthetic. 



Also envisaged within the scope of the invention are 
binding moieties derived from peptides or polypeptides 
in any form, including hormones, antibodies, T cell 
30 receptors, affibodies and ligands identified from 
various protein libraries. 



As mentioned above, an important feature of the non- 
native polypeptide is that conformation is maintained in 
35 the cytoplasm and nucleus of a mammalian cell, such that 
the binding function of the binding moiety is retained. 
As further mentioned above, this is achieved by 



WO 02/08263 



PCT/GB01/03252 



- 23 - 

providing a non-native polypeptide which maintains 
correct folding in the cytoplasm and nucleus of a 
mammalian cell. It has been found that such correct 
folding may be achieved using a non-native polypeptide 
5 which does not rely on disulphide bonding for 

conformation (i.e. which does not contain di-sulphide 
bonds) . As will be discussed in more detail below, a 
further preferred feature of a non-native polypeptide 
according to the present invention is the presence of a 
10 a-helical structure. 

WO 95/19374 describes several framework proteins, which 
share the beneficial feature of not relying on S-S bonds 
for their conformation. Such framework proteins are 

15 useful as framework moieties according to the invention. 
These include domains of bacterial receptors such as 
staphylococcal protein A (aaa type), protein G (IgG 
binding parts, (3pa3P type), protein L (p(3ap(3 type), and 
protein G (HSA binding parts, aaa type) . Framework 

20 moieties according to the invention can also be derived, 
from various bacterial receptors, for example, but not 
limited to those listed in the table 1: 

Table 1: Examples of G+ bacterial receptors 



Receptor [ligand] a Origin 



30 



35 



Fc [IgG] receptor type I 
type II 
type III 
type IV 
type v 
type VI 

Fibronectin receptor + 

M protein 

Plasmin receptor 

Collagen receptor 



Staphylococcus aureus 
Staphylococcus pyogenes [group A] 
Streptococcus group C, G, L 
bovine group G streptococci 
Streptococcus zooepidemicus [group C] 
Streptococcus zooepidemicus S212 
S. aureus, streptococci 
Streptococcus pyogenes [group A] 
Streptococci group A 
S. aureus, streptococci 
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Fibrinogen receptor 
Protein L[ 6 light chains] 
Protein H[ human IgG] 
Protein B [human IgA, Al ] 
Protein Arp [human IgA] 
Serum albumin receptor 



streptococci groups A, C, G 
Peptostreptococcus magnus 
Streptococcus pyogenes (group A] 
Streptococcus a'galactiae [group B] 
streptococci group A 
streptococci groups A, C, G 



Ligand is indicated when not obvious from receptor name 



10 Framework moieties according to the invention are not 

restricted to /structures from bacterial receptors. Also 
useful in the invention are other polypeptides 
comprising a-helical structures, often referred to as 
a-helical coiled coils, which are known from different 

15 sources (Cohen et al., Science 263:488-489, (1994); 

Harbury et al . , Science 262:1401-1407, (1993)). The 
coil making up the framework in these peptides consists 
of repeats of amino acid sequences containing 
characteristic positioned hydrophobic residues. The 

20 structure of a-helical coiled coils is not dependent on 
intra- or intermolecular disulphide bridges for 
stability. Examples of a-helical coiled coils are the 
neck region peptide from human lung surfactant protein 
D, members or the spectrin superfamily, the leucine 

25 zippers and parts of the hemagglutinin in influenza 
virus . 



Particularly preferred as sources of non-native 
polypeptides according to the present invention are 
30 members of the three-helix bundle family (e.g. as 

exemplified by the Z-domain of staphylococcal protein 
A) . 

Thus, in one embodiment, the non-native polypeptide 
35 according to the invention comprises a framework moiety- 
which is based on the structure of a domain derived from 
a bacterial receptor. Preferred structures of the 
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framework moieties of the invention are derived from or 
based upon the alpha-alpha-alpha (aaa) -three-helix 
bundle or the beta-beta-alpha-beta-beta ((3(3a33) 
structure classes. Also preferred are structures based 
5 upon or derived from the a-helical coiled coil family, 
particularly a member of any of the 2-, 3- or 4-helical 
coiled coil families. 

In an embodiment of the invention, it is envisaged that 
10 the binding moiety is present within one or more of the 
helical bundle's and/or one or more of the loops 
connecting these bundles. 

In a further preferred embodiment of the invention, the 
15 framework moiety is based on the structure of a domain 
derived from staphylococcal protein A, streptococcal 
protein G or Peptostreptococcus magnus protein L. 

In a further particularly preferred embodiment of the 
20 invention, the framework moiety is a derivative of the 
immunoglobulin binding Z-domain from staphylococcal 
protein A (Nord et al., supra). 

In such embodiments the binding moiety may be created by 
25 combinatorial protein engineering as discussed above. 
In other embodiments, the domain or protein selected 
(e.g. the Z-domain) may be used in wild-type form as the 
non-native polypeptide of the invention. 

30 As mentioned above, through combinatorial protein 

engineering, e.g. targeted to surface-located residues 
of the Z-domain, libraries may be constructed from which 
novel variants (i.e. novel binding molecules) termed Z-- 
domain affibodies, may be selected by binding the 

35 desired target ligand (Hansson et al., Immunotechnology 
4 : 237-52 (1999) ) . 
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Preferred Z-domain-based non-native polypeptides may 
comprise the following amino acid sequences: 

VDNKFNKEXXXAXXEIXXLPNLNXXQXXAFIXSLXDDPSQSANLLAEAKKLNDAQAPK 
5 [SEQ. ID. NO.: 45]; 

and 

VDNKFNKEXXXAXXEIXXXXXXXXXQXXAFIXSLXDXXXXSANLLAEAKKLNDAQAPK 
10 [SEQ . ID. NO.: 46], 

where X is any amino acid. 

In the above polypeptides, the conserved (i.e. 
15 specified) amino acid residues may be regarded as 

constituting the framework moiety, and the variable 
residues X, as together providing the binding moiety. 

Furthermore, framework moieties according to the 

20 invention need not be dependent on alpha-helices for 

their stability, provided that the required conformation 
for ligand binding by the binding moiety is retained 
after expression of the ' non-native polypeptide in the 
nucleus and cytosol of a host cell. Such frameworks for 

25 example include frameworks derived from certain 

antibodies and their derivatives, known in the art, 
which do not require the presence of disulphide bridges 
to maintain structure and can therefore be expressed as 
part of a recombinant viral component in the nucleus or 

30 cytosol of a host cell, whilst retaining a functional 
conformation and functioning as binding moieties for a 
desired ligand. Other antibody structures within the 
scope of the invention include antibody structures in 
which structurally relevant ' cysteine residues have been 

35 replaced with alanine residues, and which retain the 
binding specificity of the antibody. 
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Additional framework moieties are also envisaged within 
the scope of the invention, for example, sequences 
derived from antibodies (e.g. monoclonal antibodies) and 
antibodies converted into a single-chain format allowing 
5 for the construction of genetic fusions to, or the 

modification of, viral components. In such a case where 
an antibody-derived framework moiety is used, then 
similarly as described above, a binding moiety may be 
created by protein/genetic engineering techniques (e.g. 
10 by modifying certain amino acid residues) or a binding 

moiety may be introduced (e.g. linked or grafted) to the 
antibody framework . 

Antibody fragments conferring a desired binding activity 
15 can be used as binding moieties through the grafting of 
complementarity determining loops or regions (CDRs) of a 
an antibody with a desired tropism into a framework 
moiety capable of productive folding, i.e. capable of 
retaining the binding specif icity ( ies ) of attached or 
20 incorporated binding moieties, in the cytoplasm and 
subsequent transport into the cell nucleus where the 
virus assembly takes place. In other words, a binding 
moiety may be a CDR of an antibody. 

25 Further, according to the invention, certain antibody 

structure frameworks can be used as framework moieties, 
for example in conjunction with antibody-based or 
derived binding moieties e.g. specificity determining 
loops (CDRs), resulting in the directed construction of 

30 antibodies suitable for construction of re-targeted 
viral components. 

An antibody framework may be used as a framework moiety 
(for example to receive CDR loop(s) as binding 
35 moiety (ies)) according to the present invention, 

provided that it is capable of productive folding in the 
cytoplasm and subsequent transport into the cell 
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nucleus. Single chain Fv fragments from monoclonal 
antibodies without disulfide bonds have been produced 
(Proba et al. f J. Mol. Biol., 275: 245-53, (1998)). 
Such ScFv antibodies provided they meet the functional 
5 criterion above, may be used on framework moieties 

according to the present invention . The binding moiety 
may be the binding site of the ScFv antibody itself, or 
it may be further engineered as discussed above. 

10 Intracellular selection of functional antibodies from a 
polyclonal repertoire has also been achieved (Gargano et 
al., FEBS Letters 414; 537-40, (1997)). This selection 
technology may be useful in the identification of 
antibody fragments suitable for genetic retargeting of 

15 adenoviruses, namely suitable for use as framework, 
and/or binding moieties according to the invention. 

Other antibody frameworks which may be used, include 
those based on camel antibodies which are naturally 
20 devoid of light chains, or certain VH regions derived 
from conventional antibodies. 

An example of an antibody framework according to the 
invention that is capable of productive folding in the 

25 cytoplasm, is a particular anti (3-galactosidase single 
chain Fv fragment. This single chain variable chain 
fragment (VK, linker, VH) reactive with (3-galactosidase 
and capable of being expressed in the cytoplasm has been 
previously described (Martineau P and Betton J-M: J. 

30 Mol. Biol. 292, 921-929 (1999). Thus, the invention also 
provides a modified virus comprising a framework moiety 
which comprises a sequence encoding this anti 
(3-galactosidase single chain Fv fragment [SEQ ID. 47.] 
and functionally equivalent variants thereof. 

35 

The invention is not limited to framework moieties based 
upon antibody and affibody (e.g. receptor) structures, 
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but is extended to include other structures provided 
these are capable of retaining the binding specificity 
of attached or incorporated binding moieties when 
expressed in the cytosol/nucleus of infected cells. 

5 

Two or more non-native polypeptides, e.g. having 
different binding specificities, may be present in a 
modified virus of the invention, or a non-native 
polypeptide may incorporate two or more different 
10 binding moieties, or indeed two or more different 
framework/binding moiety constructs. 

Accordingly, the invention provides a modified virus 
which comprises a first non-native polypeptide which 
15 binds a target cell and a second non-native polypeptide 
which binds a production cell or permissive cell. 

Non-native polypeptides according to the invention may 
thus be present in recombinant viral components as 
2 0 comprising bi- or multi functional framework moieties 
(or framework/binding moiety constructs) constructed 
through genetic fusion between two or more different 
framework moieties (framework/binding moiety 
constructs) . Use of such framework moieties (or 

2 5 framework/ binding moiety constructs) can, for example 

confer infectivity of multiple cellular targets to the 
recombinant virus. 

Also provided by the invention is a modified virus in 

3 0 which the non-native polypeptide according to the 

invention comprises a cleavage site positioned in a 
location that enables a binding moiety of the non-native 
polypeptide to be cleaved from the modified virus, for 
example preceding the binding moiety before the distal 
35 end of the fiber relative to the assembled virion. 
Examples of suitable cleavage sites are sites 
susceptible to a Factor Xa enzyme or a protease such as 
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the human rhinovirus 3C protease. If cleavage is carried 
out at the genetic level, for example, within a nucleic 
acid molecule comprising a part or all of a recombinant 
adenoviral genome, a cleavage site susceptible to an 
5 appropriate restriction enzyme or ribozyme may be used, 
for example in a particular cell population in which the 
virus is produced or targetted. 

Non-native polypeptides according to the invention are 
10 capable of transport through the nuclear membrane. As 
is known from the art, this is an essential feature of 
viral components during replication in a host cell where 
expression of such components takes place in the 
cytosol, and assembly of such components takes place in 
15 the nucleus of the host cell e.g. in adenoviral 
replication. 

The non-native polypeptide of the invention may be 
viewed as performing a dual role or function, firstly as 

2 0 providing a new binding domain (i.e. conferring altered 
tropism) and secondly as functioning as a viral 
component, namely a role as a functional part of a viral 
component (viral protein e.g. viral capsid protein) . 
The non-native polypeptide may thus contribute to or 

25 function as a part of a viral protein. In other words, 
the non-native polypeptide may play a structural role or 
function, in the construction or assembly of a virus 
particle . 

30 Cells comprising, or infected by, the modified virus of 
the invention also fall within the scope of the 
invention. Such cells may be of any origin provided 
that the cell type is capable of harbouring or 
propagating the virus. Generally, however, such cells 

35 will be mammalian cells. Such cells may be transfected 

with a membrane component (e.g. a membrane protein) that 
permits infection of the cell by the modified virus and 



WO 02/08263 



PCT/GB01/03252 



- 31 - 

thus permits propagation of a virus which has a non- 
native tropism in a specific cell line, for example 
where propagation in conventional cell lines known in 
the art is prevented by way of the altered tropism of 
5 the virus. 

Nucleic acid molecules encoding the non-native 
polypeptides, or modified viral component proteins or 
modified viruses of the invention are also envisaged. 

10 Vectors comprising such nucleic acid molecules, or the 
nucleotide sequence encoding the non-native 
polypeptides, or modified viral component proteins or 
modified virus of the invention, either for propagation 
of modified virus or further engineering of modified 

15 viruses are also included within the scope of the . 
invention. 

In a further aspect the invention provides a method for 
producing a modified virus according to the invention in 

20 cell culture. In one embodiment, the method comprises 

the steps of: genetically modifying a virus to produce a 
modified (e.g. recombinant) virus containing a 
non-native polypeptide, which polypeptide comprises one 
or more framework moieties each containing one or more 

25 binding moieties, which polypeptide is capable of being 
expressed in the cytoplasm and nucleus of a human host 
cell and there assuming and maintaining a conformation 
in the absence of a ligand for said binding moieties, 
which allow said binding moieties to subsequently bind 

30 with a said ligand and which polypeptide is capable of 
transport though the nuclear membrane, wherein said 
recombinant virus has an altered tropism conferred by 
said binding moieties/ infecting permissive cells with 
such a virus; culturing said cells to produce the virus 

35 (e.g. at a sufficiently high titre) , and, harvesting, 
and optionally, purifying the modified virus produced. 
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Ad vectors can be made replication competent or 
incompetent for permissive cells. For tumour therapy, 
replication competent Ad has the potential advantage 
that it can replicate and spread within the tumour 
5 (Miller et al., Gene Therapy, 3: 557-559). This may 
theoretically result in an increase of the chosen 
effector mechanism over that obtainable with replication 
incompetent vectors. Furthermore, infectious virus may 
contribute to an anti-tumour effect by cytopathogenic 
10 effects in infected cells as well as by evoking an anti- 
viral immune response which may harm infected cells. 

It is desirable that a means of controlling the 
replication of competent modified virus is available in 
15 gene therapy applications. 

Other means of controlling the replication of modified 
adenovirus are within the scope of the present invention 
For example, the modified virus according to the 

20 invention may comprise a gene encoding a viral protein 

required for viral replication present under the control 
of an inducible promoter or genetic element. For 
example, adenovirus pre-terminal protein (pTP) , may be 
present under the regulation of a tetracycline 

25 responsive transcription activator (trTA) such that the 
pTP is only expressed in the presence of doxycycline. 
Alternatively, the modified virus according to the 
invention may comprise modifications to the genome such 
that the virus only replicates in cells which have a 

30 defect in the DNA synthesis - apoptosis regulatory 
pathways . 

Conveniently this may be achieved in a modified virus of 
the invention by introducing the further feature of a 
35 cleavage site upstream of the binding moiety, or 

positioned in any such location that enables the binding 
moiety to be cleaved from the modified virus, e.g. 
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between the fiber shaft and the binding moiety. 

Thus in a further aspect, the invention also provides a 
method of regulating the replication of a modified virus 
5 comprising the steps of: constructing a modified virus 
such that a cleavage site (e.g. a site susceptible to 
enzymatic or chemical cleavage) is positioned between a 
binding moiety required for cell infection and the 
remainder of the recombinant viral component of which 

10 the binding moiety forms part, and, bringing said 

recombinant virus into contact with a cleavage agent or 
cleavage means (e.g. an enzyme, chemical or in the case 
of a photo-labile cleavage site, light) capable of 
cleaving said binding moiety from said viral component 

15 and thereby preventing the recombinant virus from 
undergoing further infection cycles. 

Where the means of cleavage in the method is enzymatic, 
the cleaving enzyme can be encoded within the genome of 

20 the recombinant virus and can be inducible. In a 

preferred embodiment the cleavage site is cleaved by a 
Factor Xa enzyme. The use of other proteases known in 
the art is also envisaged, for example, the human 
rhinovirus 3C protease, which is available as a fusion 

25 protein with GST (Walker et al., Bio/Technology 12:601 
(1994)). This protease is active at 4°C, and recognises 
and cleaves the sequence LEVLFQ // GP. 

Non-native polypeptides of the invention can be selected 
30 from libraries after screening of such libraries for 

correct nuclear and cytosolic folding of the peptide and 
a desired binding function in a manner similar to phage 
display techniques as known in the art. Such libraries 
may consist of candidate peptides fused to wild type or 
35 modified (e.g. recombinant) viral components according 

to the invention, for example adenoviral fiber proteins. 
A fiber fusion library can be expressed on adenovirions 
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and used to select for correct cytosolic and nuclear 
conformation by observing those candidate fusions which 
facilitate replication . 

5 Modified virus according to the invention may also be 
constructed' with both wild type and modified viral 
components present, or encoded in the genome, and if 
desired, with each component gene under the control of 
different genetic control elements, for example 

10 promoters. Thus, a modified virus according to the 

invention may comprise a modified viral component (i.e. 
modified to include or comprise a non-native 
polypeptide) and an equivalent or corresponding viral 
component which is unmodified, for example a wild-type 

15 fiber, and a modified fiber. 

For example, a recombinant adenovirus can be constructed 
with a wild type fiber and a modified or recombinant 
fiber (i.e. a modified fiber comprising or incorporating 

20 a non-native polypeptide) e.g. a modified fiber derived 
from a 'fiber-candidate binding peptide' fusion library, 
which is expressed under control of a different promoter 
in an El deleted adenovirus. For example, the inventors 
have shown that a recombinant adenovirus fiber gene 

25 under the CMV promoter can be cloned into the multiple 
cloning site of the shuttle vector pAdTrack and that 
viruses can be produced that express both the WT fiber 
and the recombinant fiber by homologous recombination 
using the Ad vector pAdEasy (He T-C, Zhou S, DaCosta LT, 

30 Yu J, Kinzler KW and Vogelstein B: A simplified system 
for generating recombinant adenoviruses. Proc Natl Acad 
Sci, USA, 95:2509-2514, 1998). 

It is also possible to have one or more fiber genes (for 
35 example, a wild type gene and a modified gene) present 
in an adenoviral vector genome under the control of one 
or more inducible promoters, thus allowing each gene to 
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be switched on or off independently. For example, in a 
desired Ad construct, the wild type fiber gene, 
displaced from its normal position and its control by 
the major late promoter (MLP) , can be cloned under 
control of the hormone inducible MMTV promoter, 
inducible by dexamethasone . This promoter allows for 
expression of the wild type fibre in the vector 
propagating cells. preferably, the modified fiber is 
cloned in the same Ad vector genome, for example, 
downstream from a TRE ( tetracyclin-responsive element) 
sequence element that is switched on the in the desired 
cell line. In cells expressing the tTA protein (a 
transcriptional activator which binds to TRE), TRE will 
be activated in the absence of Tc ( tetracyclin) or Dox 
(doxacyclin) . Alternatively and more favourably, Ad 
vector constructs can be propagated in cells which 
express reverse tTA (rtTA) where TRE, and expression the 
modified fiber, is activated in the presence of Tc (or 
Dox) . 

The recombinant fiber proteins in the library can have 
the wild-type fiber knob replaced by an external 
trimerisation motif and one or more members of a peptide 
library (or other non-native polypeptide) . Screening of 
the recombinant fiber proteins for a desired binding 
activity can be carried out in a manner analogous to 
phage display once the recombinant adenovirus has been 
propagated via expression of the wild type fiber in a 
culture of propagator cells. 

Accordingly, modified virus with an altered tropism will 
require cells which it is capable of infecting in order 
to allows its propagation. Thus the invention also 
provides permissive cells for virus according to the 
invention which are capable of being cultured to 
propagate the virus. A replication incompetent virus 
(which generally comprises a deletion in its genome 
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rendering it incompetent) needs special producer or 
propagator cells which are able to supply the genetic 
information that is deleted or missing from the virus, 
in order to replicate. Such cells or cell lines are 
5 known in the art. A modified virus with an altered 

tropism may be unable to infect such cells as are known 
in the art. Accordingly the invention includes the 
modification of such a propagator cell to include a 
binding partner (ligand) for the binding moieties of the 
10 modified virus (e.g. such that the cells express such a 
ligand on thei'r surface) . 

Accordingly the invention provides a cell, preferably an 
in vitro or ex- vivo eukaryotic cell, containing a 
15 modified virus or viral component of the invention. 

It is also envisaged that modified virus with an altered 
tropism might propagated in the same cells as the wild 
type virus by means of positioning the new tropism- 

20 altering binding moieties within a viral component which 
also comprises a wild type binding moiety (i.e. knob), 
with a cleavable site (e.g. for an enzyme such as Factor 
Xa or human rhinovirus 3C protease) between such 
moieties. After propagation in conventional cells via 

25 action of the wild type binding moiety, the viral 

component may be cleaved to remove the wild type binding 
moiety (e.g. the wild- type knob) and to reveal the new 
binding moiety and to confer the new tropism. It is 
also envisaged that the revealed binding moiety can be 

30 specific for a ligand which itself possesses a cell 

binding capability. For example, the binding moiety can 
be specific for the Fc region of an antibody, thereby 
permitting the use of an antibody which is cell specific 
for a desired cell type. Removal of the wild type 

35 binding moiety by cleavage and exposure of the virus to 
a cell specific antibody, allows the antibody to be 
adsorbed to the virus and thus targets the virus to the 
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cells specified by the antibody. 

Non-native polypeptides according to the invention 
preferably confer an altered tropism to the recombinant 
5 virus in comparison with the tropism of the virus to 
which the non-native polypeptide has been introduced. 
Such altered tropisms allow recombinant viruses of the 
invention to be used in treatment of disease in human or 
animal subjects, either by in vivo treatment of, or ex 
10 vivo treatment of cells of, the subject requiring 
treatment. 

The tropism of the modified virus of the invention may 
be altered such that the virus, via one or more binding 

15 moieties targets particular cells. Thus the invention 
also provides a modified virus wherein a non-native 
polypeptide comprises a binding moiety capable of 
binding to a cell specific ligand which may optionally 
be Prostate Specific Membrane Antigen, EGF receptor, 

20 Her-2/Neu, VEGF receptor, CD22, gpl20, MHC/peptide 

complexes or membrane structures or surface molecules 
expressed or present on proliferating cells, tumor cells 
or virus infected cells 

25 There are many ways in which modified virus exhibiting 
an altered tropism according to the invention can be 
used in gene therapy applications. In the case of 
tumour diseases, the following options exist: 
I. Use of vectors to introduce transgenes into tumours, 

3 0 such as: 

- antisense oncogenes 

- suicide genes 

- genes for immune modulatory substances or tumour 
antigens 

35 - genes for ant i -angiogenic factors 
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Examples of transgenes suitable for inclusion in the 
modified virus of the invention are antisense oncogenes, 
suicide genes, genes for immune modulatory substances, 
genes for tumour antigens, genes for anti -angiogenic 

5 factors, cytokines, genes for vascular endothelial 
growth inhibitors, genes for fusogenic membrane 
glycoproteins, cytotoxic genes, a gene encoding an 
enzyme which converts a pro-drug to cytotoxic substance, 
a gene for cytosine deaminase, a gene for uracil 

0 phosphoribosyl transferase. In a preferred embodiment 
of the invention the modifed virus comprises transgenes 
encoding cytosine deaminase and uracil phosphoribosyl 
transferase either as separate genes or, more 
preferably, together as a bifunctional fusion gene 

In methods of treatment employing viruses of the 
invention with the transgenes cytosine deaminase and 
uracil phosphoribosyl transferase, preferably the virus 
is present or is co-administrated with one or more 
) inhibitors of dihydropyrimidine dehydrogenase. This has 
the effect of increasing the toxic effect of the 
products formed by these transgenes from the pro-drug, 
5 - Fluorocytosine . 

! Also provided by the invention is a modified virus of 
comprising one or more viral components or transgenes 
present under the control of one or more inducible 
promoters or genetic elements, which are for example 
tissue specific or which respond to the presence of an 
exogenous expression modulating substance. 

II. Use of infectious virus. This has the added value 
over the use of non replicating vectors in that virus 
can spread from cell to cell within a tumour, thereby 
multiplying the initial hit, or effect, on the tumour. 
Tumour cell destruction can occur not only by the 
cell -destroying mechanism engineered into the vector but 
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also by the cell destruction which is associated with 
the virus infection per se and by the attack of the 
body's immune response on the virus infected cells. 
This principle has already been tested in man through 
5 the direct intra- tumoural injection of an adenovirus 

which has been made gene manipulated to replicate only 
in p53 mutant tumour cells. The experience from these 
limited trials on large "head-and-neck" tumours are 
partially encouraging with a total regress of 2/11 
10 treated tumours which are otherwise resistant to any 
form of known treatment (Shen Y: Personal 
communication) . 



Al so provided by the invention is a modified virus 
15 wherein the gene encoding adenovirus death protein (ADP) 
is placed under the control of a promoter permitting 
over-expression of the protein, such that the lytic 
capacity of said modified virus is increased. 

20 Thus, for gene therapy the modified virus may further be 
modified, according to standard techniques, principles 
and proposals widely described in the literature, to 
incorporate or comprise a desired therapeutic gene or 
therapeutic nucleic acid molecule. "Therapeutic" is 
25 used broadly herein to include both therapy (in the 
sense of curative or palliative therapy of a pre- 
existing or diagnosed condition) and prophylaxis. The 
gene may encode a desired therapeutic product (e.g. a 
therapeutic polypeptide or antisense molecule) . 

30 

In a further aspect of the invention, the modified virus 
may include a site (e.g. a restriction site) for 
insertion of a desired therapeutic gene/nucleic acid 
molecule . 



One disadvantage of using a wild type virus in gene 
therapy, even after modification according to the 
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invention, is that the human or animal subjects to be 
treated with the virus may possess pre-formed antibodies 
to components of the wild type virus still present or 
only partially modified in the modified virus. For 
5 example human adenovirus serotype 5 hexon protein is 
often targetted by pre-formed antibodies. Whilst 
modification of a single viral component, for example 
the component containing the non-native polypeptide 
according to the invention, may be sufficient to reduce 
10 the initial immune response to the modified virus via 

pre-formed antibodies, it may be preferable to modify a 
further component of the virus to further reduce the 
immune response of the host . 

15 Thus the invention further provides a modified virus 
according to the invention which further comprises a 
viral component which is replaced with an equivalent 
component or is modified such that binding of said virus 
by antibodies pre-formed to the wild type virus is 

2 0 reduced. For example, an adenovirus of serotype Ad5 may 

comprise a hexon protein which is swapped for a hexon 
protein of a different serotype against which pre-formed 
antibodies are present at a reduced level compared to 
serotype Ad5, for example the different serotype may be 
25 Ad37. Alternatively, a modified adenovirus may comprise 
epitope sequences of the hexon protein to which 
pre-formed antibodies bind which are modified to produce 
a recombinant hexon lacking immunogenic epitopes. 

3 0 Furthermore, the modified adenovirus may comprise a 

hexon protein which further comprises a peptide capable 
of binding a protein, for example, a non- immune system 
protein such as human serum albumen, sufficiently to 
cover the immunogenic epitopes of the hexon which are 
35 bound by pre-formed antibodies. 

Also provided by the invention is a modified virus 
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according to the invention for use in therapy or in the 
preparation of a medicament for the treatment of tumour 
cells or proliferating cells. 

5 Additionally provided by the invention is a 

pharmaceutical composition comprising a modified virus 
of the invention and a pharmaceutically acceptable 
carrier or excipient . 

10 The invention will now be described in more detail with 
reference to the following non-limiting Examples, in 
which : 

Figure 1 shows a schematic descript ion of the sequence 
15 of construction of different recombinant fibers. 

A. The NRP sequence is supplied with flanking Sphl and 
Xhol sites using PCR and ligated into WT fiber which has 
been supplied with flanking EcoRl and Xhol sites. The 
resulting fiber is called Al and contains the fiber 

20 tail, first shaft repeat and the NRP motif. 

B. EGF is joined to the fiber Al by SOE. In the 
process an amino acid linker and a Clal restriction site 
is added between the NRP and EGF sequences. The 
resulting fiber is named Al EGF. In this fiber, EGF can 

25 be substituted for new ligands by ligation using the 
Clal and Xhol sites. 

C. The NRP -Linker- EGF part of fiber Al EGF is 
subjected to PCR. In the process an upstream Nhel site 
is introduced into the sequence in frame with the WT 

30 fiber sequence. After AT cloning, the sequence is 
ligated into WT fiber using Nhel and Xhol to create 
fiber A7 EGF. A7 differs from Al in that A7 contains 
the first seven shaft repeats of the Ad5 fiber. In the 
A7 EGF construct, EGF can be substituted for new ligands 

35 by ligation using the Clal and Xhol sites. 

Figure 2 shows a schematic representation of the 
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different recombinant fibers used in the present 
application . 

A, The Ad5 wild type fiber. 

B. The fiber A7 EGF described above. 

5 C. The fiber A7 scFv C242 obtained by substituting EGF 
in fiber A7 EGF for the scFv fragment by ligation as 
mentioned above. 

D. The fiber A7 scFv G250 obtained by substituting EGF 
in fiber A7 EGF for the scFv fragment by ligation as 

10 mentioned above. 

E. The fiber A7 Affi IgG where EGF in fiber A7 EGF has 
been substituted for an IgG binding affibody as 
mentioned above. 

F. The fiber A7 Affi IgA where EGF in fiber A7 EGF has 
15 been substituted for an IgA binding affibody as 

mentioned above. 

G. G. The fiber A7 Affi IgG/Affi IgA where EGF in 
fiber A7 EGF has been substituted for an IgG binding 
affibody linked to an IgA binding affibody as mentioned 

2 0 above . 

H. The fibre A7 ZIgG/ZIgG where EGF in fibre A7 EGF 
has been substituted for an IgG binding affibody linked 
to another IgG binding affibody as mentioned above. 

I. The fibre A7 ZIgG Xa Knob where EGF in fibre A7 EGF 
25 has been substituted for an IgG binding affibody linked 

to a cleavable wild type fibre knob. 

Figure 3 shows the binding of ZIgG affibody when 
incorporated into fibers expressed on virions. 

30 Panel A . After binding of virus or protein to the 

membrane, it was incubated with Fc3(l) followed by HRP 
conjugated anti human IgG and developed. 1 = the virus 
A7 ZIgG Xa Knob; 2= the same virus after cleavage with 
Xa; 3 = WT virus; 4 = virus with two fibers e.g. WT and 

35 A7 ZIgG; 5 = Fc3(l); 6 = Protein A; 7 = Protein AG. 
Panel B . After binding of virus or protein to the 
membrane, it was incubated with Fc3 followed by HRP 
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conjugated anti human IgG and developed. 1 = the virus 
A7 ZIgG Xa Knob; 2= the same virus after cleavage with 
Xa; 3 = WT virus; 4 = virus with tv/o fibers e.g. WT and 
A7 ZIgG; 5 = Fc3 ; 6 = Protein A; 7 = Protein AG. 
5 Panel C . Control to show presence of virus on membrane. 
1 = the virus A7 ZIgG Xa Knob; 2= the same virus after 
cleavage with Xa; 3 = WT virus. After binding of virus 
the membrane was incubated with HRP conjugated anti Ad5 
hexon and developed. 
10 Panel D , Treated as membranes in panel A and B but the 

incubation with Fc was omitted. 1 = the virus A7 ZIgG Xa 
Knob; 2 = WT virus; 3 = protein A; 4 = protein AG. 

Explanatory note: Fc3 (1) is known to bind ZIgG and 
15 protein A. Fc3 is known to bind protein G (AG) only. 

Figure 4 shows a schematic representation of the fibers 
described in Example 11. The NRP-linkers were inserted 
in the Nhel site upstream of NRP. 

20 

Figure 5 shows gel photos demonstrating expression and 
solubility of recombinant fibers in Sf9 cells. (A), 
whole cell lysate; (B) , soluble fraction. 
Baculovirus- infected cells were harvested at 48 h after 

25 infection, lysed in isotonic buffer, and cell lysates 

divided in two aliquots. One aliquot was centrifuged at 
10,000 x g for 10 min and supernatant was kept (panel 
B) , whereas the other was analysed as whole cell lysate 
(panel A) . Both aliquots were heat -denatured in 

3 0 SDS- sample buffer, analysed by conventional SDS-PAGE, 

and blotted. Blots were reacted with 4D2 . 5 mAb (specific 
for the fiber tail) and radiolabeled secondary antibody. 
Immunoblots were quantitated by autoradiogram scanning. 
Quantitative data, expressed as the percentage of 

35 soluble versus total fiber content, are shown in Table 
7. 
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EXAMPLES 

General procedures and starting materials , 
Recombinant adenovirus fibers were constructed using 
5 methodologies based on ligation and PCR (Clackson et 
al . , General application of PCR to gene cloning and 
manipulation, in PCR, A Practical Approach, Eds 
McPherson MJ, Quirke P and Taylor GR, IRL Press, Oxford, 
page 187, (1992)), i.e. PCR-ligation-PCR (Alvaro et al . , 

10 BioTechniques 18: 746-750 (1995)) and splicing by 

overlap extension (SOE) (Horton et al . , Recombination 
and mutagenesis of DNA sequences using PCR, in McPherson 
MJ (ed) , Directed Mutagenesis, IRL Press 1991, p 217.). 
Gene products generated by PCR were generally cloned 

15 into the vector pCRII (Invitrogen Corp.) using so called 
TA cloning (Clark J.M., Nuc . Acids Res. 16: 9677-86, 
(1988) ) . Subclonings were performed according to 
standard methods (Sambrook et al . , Molecular cloning. A 
laboratory manual. Second Edition. Cold Spring Harbor 

20 Laboratory Press, (1989)) in the vector pGEX-4T-3 
(Amersham Pharmacia Biotech) . Genes encoding 
recombinant fibers were sequenced using the Perkin Elmer 
ABI Prism sequencing equipment and were expressed in 
mammalian cells (SV4 0 transformed African Green monkey 

25 kidney cells, COS7, obtained from American Type Culture 
Collection, VA, USA) using vectors described below, and 
in insect cells (Sf9 cells from S. frugiperda obtained 
from American Type Culture Collection, VA, USA) using 
Baculovirus expression (Kitts et al . , Biotechniques 

30 14(5): 810-7, (1993)) (virus and vector from Clontech, 
Palo Alto, CA, USA) and stained with monoclonal 
antibodies specific for fiber tail, trimeric fiber and 
the new cell binding ligand. The following parameters 
were evaluated by immunostaining : 
35 i) nuclear transportation 

ii) functional expression of the new cell binding 
ligand 
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iii) ability to form trimers 

Recombinant fibers were rescued into the Ad genome by a 
recently developed procedure (as described in Example 7 
5 herein). The plasmid pTG3602 (Chartier et al . , J. 

Virol., 70: 4805-4810, (1996)) containing the entire Ad5 
genome as a Pacl-Pacl fragment was used as starting 
material. The approximately 9kb fragment of the genome 
between Spel and Pacl and containing the wild type fiber 

10 gene was cloned separately in pBluescript. From this 

fragment an approximately 3kb fragment between Sacl and 
Kpnl was further subcloned. A deletion of the native 
fiber gene with the exception of the N-terminal 
nucleotides upstream of the Ndel site of the fiber was 

15 created in the 3kb fragment and an Xhol site introduced 
in its stead allowing for ligation of recombinant fibers 
into the fiber-deleted 3kb fragment (the 3 kb fiber 
shuttle) between Ndel and Xhol. 

20 The 3 kb fiber shuttle with recombinant fiber was 

re -introduced into the 9 kb fragment cut with Nhel using 
homologous recombination in E.coli (Chartier et al . , 
Supra) . The resulting recombinant 9 kb fragment was 
finally excised from the vector with Spel and Pacl and 

25 joined to the isolated 2 7 kb fragment by Cosmid cloning. 

The presence of an insert of the expected properties was 
verified in all cosmid clones by PCR. Cosmid clones 
were also restricted with Hind III and the presence of 
30 restriction fragments of the expected size verified on 
gels . 

Recombinant Ad genomes were isolated after restriction 
with Pac 1 and used to transfect suitable cells. The 
35 occurrence of plaques was determined by microscopic 
inspection of the transfected cell cultures. 
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List of oligonucleotide primer names for the primers 
used (all given as 5 1 -3 1 sequence): 

(SEQ. ID. NO. : 20] Name 14 9: TTCCTCGAGTTATTCTTGGGCAATGTATGA 
[SEQ. ID. NO. : 21] Name 175: GGGGAATTCGATGAAGCGCGCAAGACCGTCTGAA 

5 [SEQ. ID. NO. : 22] Name: 196: GCTCGAGTTATCCGTTTGGAAACAACTCTAC 
[SEQ. ID. NO.: 23] Name: 228: CTCGAGTCATCTCAATTCCCACCACTT 
[SEQ. ID. NO.: 24] Name: 238: TGGCATGCCTGACGTAGCAAGCTTACGA 
[SEQ . ID. NO.: 25] Name: 253: 
GGGGAATTCATCGATGCAGGTCCAGTTGGTGCAGTCT 

0 [SEQ. ID. NO.: 26] Name: 265: CAGGTCCAGTTGGTGCAGTCT 
[SEQ. ID. NO. : 27] Name: 269: 

GGGGGCCTGGGCGTCGTTCAGCTTCTTGGCTCCGTTTGGAAACAACTCTAC 
[SEQ. ID. NO. : 28] Name: 270: 

CTGAACGACGCCCAGGCCCCCAAGAGCGACCCATCGATCATGAACTCCGACTCCGAATGT 
5 [SEQ . ID. NO. : 29] Name: 273: 

CCCCTGGAGTTAAATTTTCTTGTCCACCTTGGTGCT 
[SEQ. ID. NO. : 30] Name: 274: 

GGGGAATTCATCGATGGACTACAAAGATATTGTGATGACGCAGGCT 

[SEQ. ID. NO. : 31] Name: 275: CTACCTCGAGTTAACACTCATTCCTGTTGAAGC 
0 [SEQ. ID. NO. : 32] Name: 326: GGGGCTAGCCCCTGACGTAGCAAGCTTACGA 

[SEQ. ID. NO. : 33] Name: 403: GGG CTC GAG TTA CTC GAT GGG GGC TGG 
GAG GGC 

[SEQ. ID. NO. : 34] Name: 414: 

GGCCCCCGAGGCCTCGAGTGAGGAGACGGTGACCGTGGT 
5 [SEQ. ID. NO. : 35] Name: 416: 

GGCCCAGCCCACGAATTCATCGATGGATATTGTGATGACGCAGGCT 

[SEQ. ID. NO. : 36] Name: 418: AGA CTG CAC CAA CTG GAC CTG 

(SNN) 18 CCGTTTCAGCTCCAGCTTGGT (S is dA, dG or dC and N is dA, dG, 

dC or dT) 

0 [SEQ. ID. NO. : 37] Name: 473: GGC AAT TCC ATC GAT CGC CAC CAT GGA 
CAT TGT GAT GAC CCA GTC T 

[SEQ. ID. NO. : 38] Name: 474: CCC CTC GAG TTA ACA CTC ATT CCT GTT 
GAA GCT 

[SEQ. ID. NO. : 39] Name: 476: ACC ACG GTC ACC GTC TCC TCA GCT GAT 
5 GCT GCA CCA ACT GTA 

[SEQ. ID. NO. : 40] Name: 478: TGA GGA GAC GGT GAC CGT GGT 

[SEQ. ID. NO. : 41] Name: 503: GGGCCATCGATCGTAGACAACAAATTCAACAAA 
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(SEQ. ID. NO.: 42] Name: 504: GGGCTCGAGTTATTTCGGCGCCTGAGCATCATT 
[SEQ. ID. NO. : 43] Name: 550: 

TCGGTTTGGAAACAACTCTACCTTTTTTTTCGGCGCCTGAGCATCATT 
[SEQ. ID. NO. : 44] Name: 551: 
5 AAAAAGGTAGAGTTGTTTCCAAACGGAGTAGACAACAAATTCAACAAA 

EXAMPLE 1: 

Genetic insertion of a t rimerisat ion motif ( the neck 
10 region peptide from human lung surfactant D) into 
adenovirus f ib'ers . 

The gene encoding Ad 5 WT fiber was obtained from a 
preparation of Ad5 virus by PCR using an upstream primer 

15 (Primer 175) identical to the first six coding triplets 
of the fiber plus an EcoRl site and a downstream primer 
(Primer 149) annealing to the six terminal coding 
triplets of the fiber plus an Xhol site. The fiber thus 
obtained [SEQ. ID. NO. : 1] was cloned into the vector 

20 pBluescript using these restriction sites and can be 
further sub-cloned into other vectors using the same 
restriction enzymes. 

Fiber peptides were made where the knob was replaced 
25 with an external trimerisation motif (see below) . The 
purpose behind the introduction of an external 
trimerisation motif is two-fold: a) to remove the knob 
containing the native trimerisation signal but also the 
cell binding part of the fiber, and b) simultaneously to 
30 supply the necessary trimerisation signal. ■ In this case 
one particular amino acid motif have been used, i.e. the 
36 aa "Neck Region Peptide" = NRP [SEQ. ID. NO. : 2] from 
human "Lung Surfactant Protein D" (Hoppe et al., Supra). 
It should be noted that the sequence used is slightly 
35 longer than the actual trimerisation part of NRP in that 
the eight amino acids (KKVELFPN) following the 
trimerisation signal in human lung surfactant protein D 
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has been retained in all constructs containing the NRP 
sequence. The sequence KKVELFPN functions as an 
efficient linker between the trimerisation signal and 
the C-terminal carbohydrate domain of the human lung 
surfactant D and is considered to have the same 
important function in the recombinant fibers described 
herein. The DNA sequence coding for the trimerisation 
motif was synthesized, cloned and verified by 
sequencing . 

In order to introduce the NRP motif into the adenovirus 
fiber, the NRP sequence was subjected to PCR (Clackson 
et al., Supra) with the upstream primer 238 containing 
Sphl N-terminally of the NRP coding sequence and the 
downstream reverse primer 196 containing Xhol 
C-terminally of the coding sequence. After cutting with 
Sphl and Xhol the NRP sequence was ligated into the WT 
fiber gene cut with the same enzyme. The resulting 
recombinant fiber Al [SEQ. ID. NO. : 3] contains the 
fiber tail and the first shaft repeat followed by the 
NRP trimerisation motif. For a schematic representation 
of the constructions and construction pathways see Figs 
1 and 2. 

To replace the cell binding function of the knob a new 
cell binding ligand was subsequently introduced into the 
fiber in addition to the external trimerisation amino 
acid motif (see below) . 

EXAMPLE 2_l 

Assembly of gene construct encoding recombinant 
adenovirus fibers with epidermal growth factor (EGF) and 
the external trimerisation motif from human lung 
surfactant D. 

For a schematic representation of the constructions and 
construction pathways see Figs 1 and 2. 
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The DNA sequence for human EGF [SEQ. ID. NO. : 4] was 
synthesized, cloned and sequenced in the project. This 
sequence was then joined to the Al fiber mentioned above 
by splicing by overlap extension. In this case the EGF 
5 gene was subjected to PCR with an upstream primer (270) 
(identical to the first seven coding triplets) and 
containing an overhang with the sequence for an amino 
acid linker [SEQ. ID. NO. : 5] derived from 
Staphylococcal protein A and a Clal restriction site, 

10 and a downstream primer (228) (complementary to the 

seven terminal; triplets of the + strand and containing 
an Xhol site) . The Al fiber gene was subjected to PCR 
with an upstream primer identical to the first six 
coding triplets of the gene and an EcoRl site (175) and 

15 a downstream primer (269) complementary to the seven 

terminal coding triplets of the + strand and an overhang 
complementary to the overhang in Back primer for the EGF 
sequence. The two PCR products where then joined by PCR 
under standard SOE conditions (Horton et al., Supra) to 

20 produce fiber Al EGF [SEQ. ID. NO. : 6]. 

In order to construct a fiber with the first seven shaft 
repeats, the NRP trimerisation signal, the 
Staphylococcal linker and EGF, the fiber Al EGF was 

25 subjected to PCR with an upstream primer (326) identical 
to the first seven 5 1 triplets of the NRP sequence plus 
an upstream Nhel site and a downstream primer (228) 
complementary to the seven terminal triplets of the Al 
plus strand. After cloning the PCR product was 

30 restricted with Nhel and Xhol and ligated into WT Fiber 
restricted with the same enzymes to obtain Fiber A7 EGF 
[SEQ. ID. NO. : 7] which is similar to Al EGF but differs 
in that it contains the first seven shaft repeats of the 
Ad5 Fiber. 

35 

Example 3: 

Assembly of gene construct encoding recombinant 
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adenovirus fibers with single-chain antibodies and the 
external trimerisation motif from human lung surfactant 
CL_ 

5 Two monoclonal antibody single chain fragments were used 
to construct recombinant adenovirus fibers. The first 
is a single chain fragment (scFv) of the monoclonal 
antibody G250, which with high selectivity has been 
shown to react with a protein antigen on human renal 
10 carcinoma cells (Oosterwijk et al., Int. J. Cancer 38: 
489-94, (1986)). The second is a single chain fragment 
of the monoclonal antibody C242 which reacts with i.a. 
colorectal and pancreatic carcinomas (Johansson C, 
Thesis, University of G N teburg, (1991)). 

15 

G250 constructs 

The single chain fragment (Variable kappa chain or VK, 
linker, variable heavy chain or VH, joining seguence and 
constant heavy domain 2 or CH2) of the antibody G250 was 

20 constructed as previously described (Weijtens et al . , 
J.Immunol., 152(2): 836-43, (1996)). ThisG250 
construct is [SEQ. ID. NO.: 8]. To permit cloning into 
the aforementioned Al and A7 fiber constructs, the 
single chain fragment was supplied with an upstream Clal 

25 site and a downstream Xhol site by PCR using primers 416 
and 403. 

C242 constructs 

The single chain fragment of the antibody C242 (Variable 
3 0 kappa, Linker, variable heavy and constant kappa or CK) 
[SEQ. ID. NO. : 9] was constructed as follows by SOE 
using cDNA from the antibody producing hybridoma as 
original templates. VKCK was amplified using primers 
274 and 275, VHCH1 was amplified using primers 253 and 
35 273. An scFv (single chain variable fragment) (VK 

LinkLib VH) was constructed by SOE as follows. VK and 
VH were amplified separately using primers 416/418 and 
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265/414 respectively with the above mentioned VKCK and 
VHCH1 as templates and joined together by SOE using 
primers 414 and 416. In the construct VK LinkLib VH, 
the linker between VK and VH is a randomized 18 amino 
5 acid sequence as described previously (Tang et al . , J. 
Biol. Chem., 271: 15682-86, (1996)). The nucleotide 
sequence for this linker is present in primer 418. The 
construct VK LinkLib VH was cloned into the vector 
pAKlOO (Krebber et al., J. Immunological Methods 201: 35- 

10 55, (1997)). Phage display and selection of antigen 

binders by pan'ning was performed using methods described 
earlier (Krebber et al., Supra). In the present 
experiments, the CanAg antigen, reacting with antibody 
C242, was adsorbed onto biotinylated antibody C241 

15 (which binds another epitope on the antigen than C242) 
bound to streptavidin coated tubes (CanAg Diagnostics 
Ltd, G'teburg, Sweden) . Several binders were isolated and 
shown to contain different linkers. A particular VK 
Linker VH construct shown by sequencing to contain the 

20 linker PPDFVPPAASFPDHSPRG (one letter amino acid code) 
was selected for further work based on antigen binding 
ability. CK was linked to this construct by SOE to 
obtain the format C242 VK Link VHCK. In this SOE the VK 
LinkLib VH was amplified using primers 416 and 478 and 

25 the CK amplified with primers 476 and 474. The 

amplified products were then joined by SOE using primers 
416 and 474. 

In the PCR reactions mentioned above the gene sequences 
30 for the single chain fragments were supplied with an 

upstream Clal site (present in primers 416 and 473) and 
a downstream Xhol (present in primers 403 and 474) to 
allow for ligation into the Al and A7 fiber constructs 
mentioned earlier (for a schematic representation of the 
35 constructions and construction pathways see Figs 1 and 

2) to construct the fibers Al G250, A7 G250, Al C242 and 
A7 C242 (Fig 1 and [SEQ I.D. NOS. 10-13]). 
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Example 4 : 

Assembly of gene constructs encoding recombinant 
adenovirus fibers with affibodies and the external 
trimerisation motif from human lung surfactant D. 

5 

To investigate if mono and divalent antigen binding 
structures based on the structure of single 
staphylococcal protein A domains could be functionally 
expressed when incorporated into recombinant Adenovirus 

10 fiber, gene constructs for mammalian and insect cell 
expression were made. Assembly of gene constructs 
encoding recombinant adenovirus fibers containing the 
IgG binding Z domain (ZIgG) derived from staphylococcal 
protein A (NilssonB., Prot . Eng. 1:, 107-13, (1987)) 

15 [SEQ. ID. NO. : 14] or a Z domain-derived IgA-specific 
affibody (ZIgA) selected using phage display 
(Gunneriusson E. et al., App. Env. Micro., 65: 4134-40, 
(1999)) [SEQ. ID. NO. : 15] was accomplished as follows. 

20 The genes encoding the respective affinity moieties were 
amplified by PCR using primers 503 and 504 on the 
following plasmid templates; pEZZmpl8 (Tang et al., 
Supra) for the Z domain construct and pKNl-dZIgA 
(Clackson et al., Supra) for the ZIgA construct. 

25 

In the PCR amplification, the genes for the two 
different affinity ligands were supplied with an 
upstream Cla I site and a downstream Xho I site in the 
appropriate reading frame for subsequent ligation into 

30 the above described fiber gene A7 EGF resulting in 

constructs encoding the recombinant fiber A7 ZIgG [SEQ. 
ID. NO.: 16] and A7 ZIgA [SEQ. ID. NO.: 17], 
respectively, which were adapted for later being rescued 
into the Ad genome (see below) for the production of 

35 recombinant viruses carrying the new binding 

specificities. Furthermore, the fiber Al ZIgG was 
constructed by ligation of the modified ZIgG into the 
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aforementioned fiber Al EGF (see also Fig 1) . 

In addition, a third gene construct was assembled 
encoding a fiber containing both of the above mentioned 
affinity domains. This construct encodes a fiber 
containing the fiber tail, the first seven shaft 
repeats, the NRP ■ sequence f the staphylococcal protein A 
linker, the IgG binding Z domain, the eight amino acid 
linker from NRP (KKVELFPN) followed by the IgA binding 
affibody. To construct this fiber the two different 
affinity domaihs were first genetically joined together 
by SOE using primers 550 and 551 with overhangs 
complementary to the nucleotide sequence encoding the 
linker sequence KKVELFPN. In the PCR process an 
upstream Clal site and a downstream Xhol site were 
introduced by the primers allowing for ligation into the 
vector pGEX-4T-3 containing the Fiber A7 EGF gene 
construct to obtain Fiber A7 ZIgG/ZIgA [SEQ. ID. NO. : 
18] . 

A further gene construct was also assembled to encode a 
fiber containing two linked ZIgG domains. This gene 
codes for the fiber A7 ZIgG /ZIgG [SEQ. ID. NO. : 19] and 
was assembled exactly as described for A7 ZIgG /ZIgA 
with the exception that the gene for ZIgG was used 
instead of ZIgA in the PCR reaction. 

For a schematic representation of the constructions and 
construction pathways see Figs 1 and 2. 

EXAMPLE 5: 
Binding studies 

The genes encoding recombinant fibers were cloned into 
the vectors pcDNA (which targets proteins for expression 
in the cytosol) and pSecTag (which targets proteins for 
expression as secreted products), both from Invitrogen 
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BV, Groningen, The Netherlands, and transfected into 
COS7 cells using Lipofectamin (Life Technologies Inc, 
Gaithersburg, MD, USA) as described by the 
manufacturers . 

Expression and cellular localization of recombinant 
fibers were evaluated by immunostaining using the 
following primary reagents: 



10 Mouse monoclonal antibody 4D2.5 (anti-Ad5 fiber) (kindly 
provided by Dr Geoffrey Engler, University of 
Birmingham, Alabama, USA) (Shin Hong et al., Virology 
185: 758-767, (1991) ) . 

Mouse monoclonal antibody 2A6.36 (ant i-trimerised Ad5 
15 fiber) (kindly provided by Dr Geoffrey Engler, 

University of Birmingham, Alabama, USA) (Shin Hong et 
al . , Supra ) . 

Mouse monoclonal antibody against Epidermal Growth 
Factor (EGF) = a-EGF (Cambio, Cambridge, UK, Cat no CA 
20 954) . 

Biotinylated mouse monoclonal anti idiotypic antibody 
directed against monoclonal antibody C242 = a-Id C242 
(Lindholm et al., unpublished results). 
Biotinylated mouse monoclonal anti idiotypic antibody 

25 directed against monoclonal antibody G250 = a-Id C250 
(Kindly supplied by Reinder Bolhuis, Daniel Den Hoed 
Cancer Center, Rotterdam, The Netherlands) . 
Human polyclonal IgG and IgA for evaluation of Affibody 
activity = HIgG (Sigma-Aldrich Fine Chemicals, Cat no 

30 14506) and HIgA (Sigma-Aldrich Fine Chemicals, Cat no 
11010) . 

Secondary reagents were: 

For identification of mouse antibodies: FITC labelled 
35 F(ab)2 rabbit anti mouse immunoglobulin (DAKO A/S, 
Glostrup, Denmark, Cat no F0313) = aMIg. 
For identification of human IgG: FITC labeled F(ab)2 
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no F0315) = aHIgG. 

For identification of human IgA: FITC labeled F(ab)2 
rabbit anti human IgA (DAKO A/S, Glostrup, Denmark, Cat 
5 no F0316) = aHIgA. 

For identification of biotinylated antibodies: FITC 
labeled Streptavidin (DAKO A/S, Glostrup, Denmark, Cat 
no F0422) . 

10 Briefly, cells were centrifuged onto microscope slides 

in a Shandon C'ytospin2 cytocentrif uge and air-dried over 
night at room temperature. The preparations were fixed 
in 3% paraformaldehyde in phosphate buffered saline, pH 
7,4 (PBS), permeabilized with 0,1% Triton-XlOO in PBS. 

15 After washing in PBS, preparations were incubated with 
primary reagents for 30 minutes at 37°C in a humid 
chamber, washed again in PBS and incubated with 
secondary reagent for 30 minutes at 37°C in a humid 
chamber. After washing in PBS, preparations were 

20 mounted in PBS with 50% glycerol and viewed in a Zeiss 
Axophot microscope equipped with appropriate light 
source and filters for FITC. 

The results are shown below in Tables 2 and 3. It is 
25 obvious that all of the different ligands show 

appropriate binding when the corresponding fibers were 
expressed as secreted products. However, only the 
affibodies show the expected correct binding when the 
fibers were expressed in the cytosol and subsequently 
30 transported to the nucleus. Therefore, not all ligands 
can fold correctly in the cytosol and nucleus. It is 
interesting that the ligands which can withstand the 
milieu in the cytosol and nucleus (affibodies) are small 
a-helical structures not depending on S-S bridges for 
35 their conformation whereas the ligands which were shown 
not to be properly expressed in the cytosol/nucleus all 
have a conformation that is dependent on the formation 
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of S-S bridges which are formed very poorly or not at 
all in the cytosol. 



Table 2 : Expression and functional ligand binding in 
recombinant adenovirus fibers targeted for secretion in 
COS cells 



Detecting reagent 
10 Fiber 4D2+ 2A6+ a-EGF+ 

aMIg aMIg aMIg 



a-MIg a-Id+ HigG+ HigA+ 
aMIg aHIgG aHIgA 





WT 




+ 


+ 


ND 


ND 


ND 


ND 


ND 




A7 


EGF 


+ 


+ 


+ 


ND 


ND 


ND 


ND 


15 


A7 


G250 


+ 


+ 


ND 


+ 


+ 


ND 


ND 




A7 


C242 




+ 


ND 


+ 


+ 


ND 


ND 




Al 


ZIgG 


+ 


ND 


ND 


ND 


ND 


-f 






A7 


ZIgG 


+ 


ND 


ND 


ND 


ND 


+ 






A7 


ZIgG/ZIgG 


+ 


ND 


ND 


ND 


ND 


+ 




20 


A7 


ZIgA 




ND 


ND 


ND 


ND 




+ 




A7 


ZIgG/ZIgA 


+ 


ND 


ND 


ND 


ND 
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Table 3: Expression and functionality of ligand binding 
in the nucleus of COS cells in native and selected 
recombinant fibers after targeting for expression in the 
cytosol 

Detecting reagent 



10 



15 



20 



Fiber 


4D2 + 


2A6+ 


a-EGF+ 


a-MIg 


a-Id+ 


HigG+ 


HigA+ 




aMIg 


aMIg 


aMIg 




aMIga 


HIgG 


aHIgA 


WT 


+ ' 


+ 


ND 


ND 


ND 


ND 


ND 


A7 EGF 


-f 


+ 




ND 


ND 


ND 


ND 


A7 G250 


+ 


+ 


ND 


+ 




ND 


ND 


A7 C242 


+ 


+ 


ND 


+ 




ND 


ND 


Al ZIgG 


+ 


ND 


ND 


ND 


ND 


+ 




A7 ZIgG 


+ 


+ 


ND 


ND 


ND 


+ 




A7 ZIgG /ZIgG 


+ 


ND 


ND 


ND 


ND 






A7 ZIgA 


+ 


+ 


ND 


ND 


ND 




+ 


A7 ZIgG/ZIgA 


+ 


ND 


ND 


ND 


ND 






A7 ZIgG Xa Knob 


+ 


ND 


ND 


ND 


ND 







The results have obvious implications for the 
construction of those re-targeted virus for human gene 
therapy where the viral structural components containing 
25 the new cell binding ligand are synthesized in the 

mammalian cell cytosol , i.e. adenovirus. Below are two 
enabling examples to show how such re-targeted 
adenovirus can be constructed. 

3 0 EXAMPLE 6r 

Grafting of CDR loops 



Certain single chain constructs of monoclonal antibodies 
retain their binding specificity even in the mammalian 
35 cell cytosol (Cattaneo et al-, TIBTECH 17: 115-121, 

(1999)). This is a function of the so called frame work 
regions of the antibody variable regions. It is known 
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that the antigen binding CDR loops can be transferred 
from one antibody to another by recombinant DNA 
technology thereby creating antibodies with frame work 
properties from one antibody and binding properties from 
5 another (for methodology see Emery et al., Strategies 

for Humanizing Antibodies, in Antibody Engineering, Carl 
A.K. Borrebaeck, (ed.), Oxford University Press 1995, 
page 159) . 

10 Such a loop-grafted single chain antibody, based on a 
variable domain framework capable of folding in the 
cytosol and subsequent transport to the cell nucleus 
thus created can subsequently be supplied with 
appropriate cloning sites at the gene level, ligated 

15 into fiber A7 RGD encoding genes and rescued into the 
adenovirus genome as described below (Example 7) . 

EXAMPLE 7: 

Rescuing of recombinant fibers into the adenovirus . 
20 genome 

The wild type fiber as encoded in the Ad genome was 
substituted for recombinant fibers by the following 
method developed within the project. In the method the 

25 wild type Ad5 genome in the plasmid pTG3 602 (Emery et 

al . , Supra) was used as receptor for genes encoding the 
recombinant fibers. This plasmid contains the entire 
wild- type Ad5 genome joined to the plasmid backbone by 
Pacl linkers. The entire genome can be recovered as a 

3 0 linear DNA fragment after cleavage with Pacl since Pacl 
sites are absent from the Ad genome. The resulting 
linear Ad DNA can then be transfected to susceptible 
cells to yield virus (Chartier et al . , Supra). From 
this plasmid the Ad genome can also be cleaved as two 

3 5 fragments, one of 27 kb and one of 9 kb, using the 
enzymes Pacl and Spel. The 9 kb fragment has been 
cloned into pBluescript . From the 9 kb fragment, which 
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contains the fiber, a 3 kb Sacl-Kpnl fragment containing 
the fiber gene was further subcloned. The fiber gene 
was deleted between the Ndel site in the tail -portion of 
the fiber and the Muni site which is situated just 
5 down-stream of the fiber-gene and an adapter containing 
an Xhol site and the down- stream sequence was introduced 
between the Ndel and Muni site to obtain a fiber shuttle 
vector. Several recombinant fibers have now been 
ligated between Ndel and Xhol of this shuttle vector and 
10 thereafter rescued into the 9 kb fragment mentioned 

above by homologous recombination in E. coli (Chartier 
et al., Supra). This means that all normal elements 
regulating fiber-expression have been left intact. 

15 Finally, recombinant 9 kb fragments were separately 
joined to the 2 7 kb fragment by cosmid cloning to 
re-create the complete Ad genome. 

The 2 7 kb fragment may also be derived from another 
20 adenovirus genome, such as the Ad-YFG described by He et 
al (He T-C, Zhou S, DaCosta LT, Yu J, Kinzler KW and 
Vogelstein B: A simplified system for generating 
recombinant adenoviruses. Proc Natl Acad Sci, USA, 
95:2509-2514, 1998). If the 27 kb fragment is derived 
!5 from pTG3602 the resulting genome will be WT E1+ whereas 
the 27 kb fragment from Ad-YFG will render El deleted 
viruses needing e.g. low passage 293 for replication. 

The recombinant Ad genome resulting from above 
0 manipulations can finally be obtained as a linear 

fragment by cleaving with Pacl and used to transfect 
permissive cell lines which then yield virus plaques if 
the genome is functional. For these transf ections the 
FuGENE 6 transfect ion agent (Roche) was used. 

5 

Various recombinant fibers were rescued into the Ad 
genome and subsequently transfected into permissive 
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cells. Results are shown below in table 4. Of the fibers 
accounted for, only those containing the WT knob and the 
affibody A7 ZIgG/IgG were capable of rendering 
functional virions. The results are in complete 
5 concordance with those shown in tables 2 and 3. 

Table 4: Rescuing of recombinant fibers into virions 



Fiber Trans fee ted cells Occurrence of virus 

10 plagues 

A7 WT Knob 293 Yes 

A7 EGF A549 No 

A7 C242 Colo 205 No 

A7 G250 A75 No 

15 A7 ZIgG/ZIgG 293/Fc* Yes 



(*) 293/Pc cells are 293 cells stably transfected with 
Fc3(l) from human IgG expressed as a membrane protein. 
To create 293/Fc, cloned Fc3 (1) from human IgG which 

20 reacts with the Z-domain (Jendeberg, L. , Nilsson, P., 
Larsson, A. , Nilsson, B., Uhlen, M. and Nygren, P. -A 
(1997) "Engineering of Fcl and Fc3 from human 
immunoglobulin G to analyse subclass specificity for 
staphylococcal protein A" J. Imm. Methods 201, 25-34.) 

25 was ligated into the vector pDisplay (Invitrogen) in 
frame with the PDGFR transmembrane domain sequence 
present in this vector. The coding sequence containing 
the Fc sequence fused to the PDGFR transmembrane domain 
sequence was then cleaved from the vector using the 

30 restriction enzymes Sfil and Notl and ligated into the 
vector pSecTag (which carries the Zeocin resistance 
gene) cleaved with the same enzymes. The reconstituted 
vector was the transfected into low passage 293 cells 
using FuGENE and the cells were placed under selection 

35 pressure using Zeocin to select for stably transformed 
cells. Clones were isolated and tested for membrane 
expression of membrane-bound Fc3(l) using FITC labeled 
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staphylococcal protein A (Sigma) . One 293 clone which 
homogenously expresses membrane bound Fc was used for 
transfection of the A7 ZIgG/ZIgG containing genome to 
produce virions. 

5 

EXAMPLE 8: 

Assembly of gene constructs encoding recombinant 
adenovirus fibers with an affibody and a cleavable wild 
type knob 

10 

Cleavable fibe'rs containing both a non-native 
polypeptide comprising an external ligand, here an 
affibody, and a C-terminally placed wild type knob were 
constructed with an activated factor X site situated 
15 between the cell binding structures so that the knob can 
be cleaved off to expose the affibody. This permits 
virus production in 293 cells with subsequent infection 
of new target cells as defined by the affibody after 
proteolytic removal of the knob. 

20 

To construct a gene for a cleavable fiber for ligation 
of different ligands the WT fiber was subjected to PCR 
with an upstream primer' (437) introducing Clal, Muni and 
the Factor Xa recognition site before the first seven 

25 triplets of repeat 22 and the downstream primer 149 
which primes at the end of the fiber knob. After 
cloning and restriction with Clal and Xhol the DMA 
fragment was cloned into the A7 fiber construct 
mentioned earlier. The resulting fiber gene contains 

30 from the N-terminus the sequence for the tail, the first 
seven shaft repeats, the NRP trimerisation signal, the 
linker from staphylococcal protein A, a Clal site, a 
Muni site, the Xa cleavage site, repeat 22 and the wild 
type knob. For ligation into this fiber gene the gene 

35 for the affibody ZIgG was supplied with an upstream Clal 
site and a downstream Muni site by PCR using primers 503 
and 505 and ligated into the aforementioned "Xa fiber" 
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gene. The resulting fiber gene contains from the N- 
terminus the sequence for the tail, the first seven 
shaft repeats, the NRP trimerisation signal, the linker 
from staphylococeal protein A, a Clal site, the new 
5 linker, a Muni site, the Xa cleavage site, repeat 22 and 
the wild type knob. 

The fiber gene was rescued into the adenovirus genome 
(see Example 7). The recombinant genome was transfected 
10 into 293 cells and virions were produced and purified on 
CsCl gradients. Purified virions were cleaved with 
activated Factor X (Xa) (Sigma) in 50mN Tris-Cl pH 8,0; 
100 mM NaCl; brnM CaCl 2 for 48 hours at room temperature. 
1 0 of Xa was used for 5|iL of virus suspension. 

15 

Assay for binding of the ZIgG affibody on the virions 
was performed essentially as follows. Virus suspensions 
were blotted onto PVDF membranes (BioRad) . After 
blocking with 3% gelatin in Tris buffered saline, pH 7,4 

20 (TBS) the membranes were incubated with FC lOpg/mL in 1% 
gelatin in TBS for 90 minutes at room temperature, 
washed, incubated with anti-human IgG-HRP (Dakopatts) in 
TBS/1% gelatin for 90 minutes at room temperature, 
washed and developed in 4-Chloro Naphtol reagent 

25 (BioRad) . The results of staining of WT and recombinant 
viruses are shown in Figure 3 along with different 
controls. It is obvious that the recombinant virus binds 
Fc3(l) which binds to WT Z whereas Fc3 which does not 
bind to WT Z (Jendeberg, L., Nilsson, P., Larsson, A. , 

30 Nilsson, B., Uhlen, M. and Nygren, P'.-A. (1997) 

"Engineering of Fcl and Fc3 from human immunoglobulin G 
to analyse subclass specificity for staphylococcal 
protein A" J. Imm. Methods 201, 25-34) fails to bind to 
the recombinant virus. WT virus does not bind either Fc 

35 preparation. 
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Example 9: 

Assay of non-native polypeptides for functional 
conformation and binding specificity 

5 

The candidate polypeptide is expressed as part of a 
viral component protein by means of insertion of the 
corresponding coding sequence into a suitable construct. 
The construct, which comprises an entire viral genome in 

10 which one or more components have been replaced by the 
corresponding /recombinant component gene, is expressed 
in a cell line suitable for propagation of the 
recombinant virus coded for by the recombinant viral 
genome. There are two main possibilities depending on 

15 whether the WT cell-binding function is retained in the 
recombinant genome or not. 

If the WT cell-binding function is retained virus can be 
produced in normal producer lines such as 293 for 

20 adenovirus. The ability of the non-native polypeptide to 
be expressed in a conformation that allows it to bind to 
an extracellular ligand and to be part of a functional 
virion is assayed by binding studies on virions. This 
approach is exemplified by the strategy employed in 

25 Example 8 above where the virus fiber contains both the 
WT knob and a non-native binding ligand and where the 
knob can be removed by proteolytic cleavage. The binding 
function of the non-native polypeptide was demonstrated 
by standard solid phase immunotechnology . Another 

30 strategy to meet the same ends would be to produce 

virions with WT fibers as well as recombinant fibers, 
the latter containing the non-native polypeptide as 
replacement for the native knob. 

35 If the WT cell-binding function has been deleted from 
the viral genome the virions will depend on the 
non-native polypeptide for cell binding. Therefore, the 
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cell line must be able to permit entry of the 
recombinant virus and, if necessary, to supply genetic 
information that may be missing from the recombinant 
virus, e.g. the sequence for El. If the recombinant 

• 5 genome is El deleted, one solution is to use an El 
transfected cell line such as 293 and to stably 
transfect these with a receptor capable of binding the 
non-native polypeptide coded for by the recombinant 
viral genome. The ability of the non-native polypeptide 

10 to be expressed in a conformation that allows it to bind 
to the corresponding receptor structure and form part of 
a functional virion is assayed by screening for plaque 
formation in the cells following transfection of the 
viral genome. This strategy was employed in Example 7 

15 above where 293 cells stably expressing membrane bound 
Fc from human IgG was used to produce virions where the 
WT binding function was replaced with the affibody ZIgG. 
The exact binding properties of the non-native 
polypeptide can further be determined in studies where 

20 the virus is allowed to compete for binding with 
peptides structurally related to the non-native 
polypeptide responsible for cell-binding. 

EXAMPLE 10: 

25 Anti B-aalactosidase single chain Fv fragment 

In a continuation of the experiments described in 
Example 3, a single chain variable chain fragment (VK, 
linker, VH) reactive with (3~galactosidase and capable of 

3 0 being expressed in the cytoplasm (Martineau P and Betton 
J-M: J. Mol. Biol. 292, 921-929 (1999)) was cloned into 
the aforementioned A7 fiber constructs. This single 
chain fragment is [SEQ. ID. NO. : 47) . To permit cloning 
into the aforementioned A7 fiber constructs, the single 

35 chain fragment was cut with Ncol/EcoRl from the plasmid 
pPM163R4 (Martineau P and Betton J-M supra) and ligated 
into a plasmid containing the aforementioned Fiber R7 
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EGF supplied with a Clal-Ncol-EcoRl-Xhol adapter 
allowing for in- frame ligation of the single chain 
fragment . 

5 In a continuation of the experiments described in 
Example 5, expression and cellular localization of 
recombinant fibers containing the single chain variable 
chain fragment (VK, linker, VH) reactive with 
(3-galactosidase were evaluated by immunostaining using 
10 the following additional primary reagent: 
(3-galactosidas'e-biotin (SIGMA, G 5025) . 

In addition to the results shown in Table 2, results 
results obtained in equivalent experiments for the Anti 
15 (3-galactosidase single chain Fv fragment fiber construct 
are shown Table 5: 

Table 5: Expression and functional ligand binding in 
recombinant adenovirus fibers targeted for secretion in 
20 COS cells 



Fiber 


4D2+ aMIg 


2A6+ aMIg 


P-galactosiclase 
+ StrAv-FITC 


R7 

a-g-galactosidase 


+ 


+ 


+ 



25 

In addition to the results shown in Table 3, results 
obtained in equivalent experiments for the Anti 
(3-galactosidase single chain Fv fragment fiber construct 
are shown Table 6: 
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Table 6: Expression and functionality of ligand binding 
in the nucleus of COS cells in native and selected 
recombinant fibers after targeting for expression in the 
cytosol 

5 



Fiber 


4D2+ aMIg 


2A6+ aMIg 


(3-galactosidase 
+ StrAv-FITC 


R7 

a- g-galactosidase 


+ 


+ 


+ 



10 

EXAMPLE 11: 

Phenotypic analysis of fibe r proteins. 

15 The following is a further example of modified viruses 
of the present invention wherein complex polypeptide 
ligands have been incorporated into a modified 
adenoviral fibre protein. The example demonstrates the 
importance for generation of a retargeted viable and 

20 functional Ad vector of two features: (i) the fibre 

structure modifications should still allow for efficient 
attachment and cellular entry of the virus > and (ii) 
ligands inserted into the fibre should be capable of 
correct folding in the mammalian cell cytoplasm. 

25 Solubility is conveniently used to assess correct 
folding which is typically linked to an absence of 
disulphide bonds. Preferably the non-native 
polypeptides described herein will have no more than 2 
disulphide bonds, typically no more than 1 and most 

30 preferably no disulphide bonds. 

Materials and Methods 

Cells. HEK-293 cells were obtained from Microbix Inc. 
35 (Toronto, Ontario, Canada) . Cos7, A431 and Colo205 cells 
were purchased from the American Type Culture Collection 
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(ATCC; Manassas, Virginia) and A75 and G43 cells were 
obtained from Reinder Bolhuis (Daniel Den Hoed Cancer 
Center, Rotterdam, The Netherlands) . All cells were 
maintained at 37 °C and 5% C0 2 in Iscove's medium (Gibco 
5 BRL) , supplemented with 10% fetal bovine serum 

( Sigma -Aldrich) and 50 mg/ml Gentamicin (Gibco BRL) . 
Spodoptera frugiperda (Sf9) cells (ATCC) were cultured 
at 28 °C in TC 100 medium (Gibco BRL) with the same 
supplements as above. 

10 

Antibodies. Monoclonal antibody (mAb) against epidermal 
growth factor (EGP) was purchased from Cambio Ltd. 
(Cambridge, UK) . Antibodies directed against the Va and 
Vp domains of single chain T-cell receptor (scTCR) , and 

15 anti-idiotypic antibodies directed against mAb G250 
(NUH31 and NUH84) were kindly supplied by Reinder 
Bolhuis. MAb against fiber tail (4D2.5), and mAb against 
fiber trimer (2A6.36) were obtained from Jeff Engler 
(University of Alabama at Birmingham, AL) . CAR-blocking, 

20 fiber knob directed mAb 1D6 . 14 was supplied by Buck 

Rogers (UAB at Birmingham, AL) . The monoclonal antibody 
RL2, which is specific for O-linked GlcNAc residues, was 
obtained from Larry Gerace via Jeff Engler. Biotinylated 
anti-idiotypic antibodies directed against mAb C242 

25 (Idl, Idl3 and Id20) were produced from the original 
hybridomas. Horse radish peroxidase (HRP) -labeled 
streptavidin, fluorescein isothiocyanate (FITC) -labeled 
rabbit anti-mouse immunoglobulin G and streptavidin-FITC 
were purchased from DAKO (Glostrup, DK) . 

30 

Generation of recombinant knobless fibers and 
nomenclature. Recombinant fiber genes were constructed 
using methods based on ligation, PCR, and splicing by 
overlap extension (SOE) . Gene sequences generated by PCR 
35 were sequenced before subcloning. The gene encoding the 
Ad5 WT fiber was obtained from pAB26 (Microbix, Toronto, 
Canada) by PCR using the forward primer (SEQ ID NO: 48) 
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5'-CTC GGA TCC GAT GAA GCG CGC AAG ACC GTC TGA A- 3 1 and 
reverse primer (SEQ ID NO: 49) 5 1 -TTC CTC GAG TTA TTC 
TTG GGC AAT GTA TGA- 3 1 introducing an upstream Bam H I 
and a downstream Xho I site, respectively. In 
5 recombinant fibers, the knob domain was deleted and 

replaced by a 36 amino acid (aa) extrinsic trimerisation 
motif derived from the neck region peptide (NRP) of 
human lung surfactant protein D. The NRP sequence 
( PD VAS LRQQ VAE L QGQ VQHLQ AAF S Q YKKVELF PNG ) (SEQ ID NO: 2), 
10 followed by a linker sequence from Staphylococcus 

protein A (Staph-A linker: AKKLNDAQAPKSD) , was ligated 
to the C-terminal end of fiber shaft of different 
lengths, 1, 7 or 22 repeats. 

15 The resulting constructs were named Rl, R7 and R22, 
respectively. Re-targeting ligands were added to the 
C-terminal end of the Staph-A linker. For convenient 
cloning of various ligands, Cla I and Xho I sites were 
introduced after the linker sequence. All ligands 

20 mentioned below were provided with these restriction 
sites, and the name of the ligand was indicated after 
the number of shaft repeats in the fiber name. E.g., 
Rl-RGD, R7-EGF, etc (Fig. 4) . R7-knob referred to a 
truncated fiber shaft (repeats 1 to 7) , carrying NRP, 

25 the Staph-A linker and the natural knob domain, 

including the last shaft repeat and the shaft -knob 
junction (Fig. 4) . For two R7-knob fiber constructs, an 
extra linker was inserted into the Nhe I site located on 
the N- terminal side of NRP. These linkers were derived 

30 from the Ad5 WT fiber shaft repeat 17 (L17 : 

TTTACAGCTTCAAACAATTCCAAAAAGCTTGAG) and fiber shaft 
repeat 22 (L22 GGAAACAAAAATAATGATAAGCTAACTTTGTGTGACC) 
were named as R7-L17-knob and R7-L22-knob, respectively. 
The schematic representation of the fibers is shown in 

35 Fig 4. 

Targeting ligands. Polypeptide ligands. Double stranded 
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DNA fragments containing the RGD or ACDCRGDCFCG 
(abbreviated RGD4C) motifs were synthesized as 
complementary oligonucleotides with single-stranded 
terminal adapters, and annealed together. The fragment 
5 containing the shaft repeat 22, the shaft-knob junction 
and the entire knob was obtained by PCR from the WT Ad5 
fiber gene . The DNA sequence for human EGF was 
synthesized, cloned and verified by sequencing. The 
cloned single chain T-cell receptor (which contains 
10 disulfide bonds) with specificity towards Magel/HLA Al 
was obtained from Reinder Bolhuis. The scTCR used had 
the format Va-Linker-V(3C(3 . 

Monoclonal antibodies. Two mAb single chain fragments 
15 (scFv) were used. The first scFv consisted of the 

variable kappa chain (or VK) , a spacing linker, the 
variable heavy chain (or VH) , the joining sequence (JS) 
and the constant heavy chain domain 2 (or CH2) of mAb 
G250. MAb G250 has been shown to have a high specificity 
2 0 towards a surface antigen of human renal carcinoma 
cells . 

Affibodies. The second scFv contained the same basic 
structure VK-linker- JS-VH, and derived from mAb C242, 

25 which reacts with colorectal and pancreatic carcinomas. 

However, to obtain a better selectivity for the designed 
target cells, a library of C242 scFv-derived affibodies 
was generated by SOE, using cDNA from the mAb-producing 
hybridoma cells as the original template. A randomized 

30 18 amino acid peptide ligand (SNN) 18 was inserted between 
the VK and VH domains, where S was dA, dG or dC and N 
was dA, dG, dC or dT. Affibodies were selected by phage 
display on colorectal carcinoma cells Colo205, and the 
sequence PPDFVPPAASFPDHSPRG was identified for the 

35 peptide ligand. The C242 scFv VK- (PPDFVPPAASFPDHSPRG) -VH 
was then selected for further work based on its antigen 
binding ability. 
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Cellular expression and localization of fiber and ligand 
reactivity. The genes encoding recombinant fibers were 
cloned into the vectors pcDNA3 . 1 and pSecTag2 
(Invitrogen BV, Groningen, Germany) for intracellular 
5 expression and extracellular release, respectively. 
Vectors were transfected into Cos7 cells using 
Lipofectamin (Life Technologies Inc., Gaithersburg, MD, 
USA) following the manufacturers' instructions. 
Expression, nuclear transport and functional expression 

10 of the fibers with the new cell binding ligands were 
evaluated by immunostaining as follows. Cells were 
centrifuged onto microscope slides in a Shandon 
Cytospin2 cytocentrif uge and air-dried over night at 
room temperature. The preparations were fixed in 3% 

15 paraformaldehyde in PBS and permeabilized with 0.1% 

Triton-XlOO in PBS. After washing in PBS, preparations 
were incubated with primary antibodies (anti-fiber mAb, 
or anti-ligand mAb) for 30 min at 37°C in a humid 
chamber, washed again in PBS and incubated with 

20 secondary antibodies labeled with FITC for 30 min at 
37 °C in a humid chamber. After washing in PBS, 
preparations were mounted in PBS with 50% glycerol and 
viewed in a Zeiss Axioskop microscope equipped with 
appropriate light source and filters for FITC. 

25 

Phenotypic analysis of fiber proteins. Recombinant fiber 
proteins were expressed in insect cells infected with 
recombinant baculoviruses, and analysed according to 
four criteria: (i) solubility, (ii) trimerization, (iii) 
3 0 glycosylation and (iv) assembly with recombinant penton 
base in vivo to form penton capsomers. 

(i) For immunological quantification of soluble versus 
insoluble recombinant fiber fractions, Sf9 cells were 
35 lysed in hypotonic buffer (10 mM Tris-HCl buffer, pH 
7.5) at 0°C, and the cell lysates were adjusted to 
isotonic conditions (150 mM NaCl in 10 mM Tris-HCl, pH 
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7.5) and subjected to centrif ugation at 15,000 x g for 
10 rain. Supernatants and pellets were then analyzed by 
conventional SDS-PAGE and immunoblotting, using 
anti-tail 4D2 . 5 mAb as primary antibody, and 
5 [ 35 S] SRL-labeled anti-mouse IgG secondary antibody 
(Amersham Pharmacia Biotech; 100 //Ci/ml; 5 fiCi per 
blot) . (ii) Oligomerization status of fiber was assayed 
by means of non- denaturing SDS-PAGE (referred to as 
NDS-PAGE) and conventional, denaturing SDS-PAGE. 

10 NDS-PAGE differed from SDS-PAGE in that the samples were 
not denatured by boiling in SDS sample buffer prior to 
electrophoresis, (iii) Glycosylation of recombinant 
fibers was assessed both by immunoreact ion on blots 
using the monoclonal antibody RL2 and chemical detection 

15 using the DIG Glycan Detection Kit (Roche) . (iv) 
Assembly of fiber with penton base was assayed by 
co-infecting the same Sf9 cells with two recombinant 
AcNPV, one expressing the penton base, the other 
expressing the fiber protein. 

20 

The presence of penton capsomer was detected in cell 
lysates after 40 h post-inf ection, and analysed by PAGE 
in native conditions, at low voltage overnight with 
cooling. Immunological quantification of native penton, 

25 penton base and fiber proteins was performed as above, 
using the corresponding primary antibody (ant i -penton 
base or anti-fiber), followed by [ 35 S] SRL-labeled 
ant i -mouse or anti- rabbit whole IgG secondary antibody. 
Blots were exposed to radiographic films (Hyperfilm 

30 beta-max, Amersham Pharmacia Biotech) , and 

autoradiograms were scanned at 610 nm, using an 
automatic densitometer (REP-EDC, Helena Laboratories, 
Beaumont, TX) . Alternatively, protein bands were excised 
from blots and radioactivity measured in a scintillation 

35 counter (Beckman LS-6500) . 



Rescue of recombinant modified fibers into virions. 
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Recombinant fibers were rescued into the Ad5 genome as 
described above. Briefly, a 9 kb-fragment from the Ad5 
genome, spanning map units (mu) 75.4 to 100 and 
containing the WT fiber gene, was generated by Spe I and 
5 Pac I digestion of pTG3602. The plasmid pTG3602 

contained the entire WT Ad5 genome bounded by two Pac I 
sites. The 75.4-100 mu- fragment was then cloned into 
pBluescript II SK(-) (Stratagene) , after addition of a 
Pac I site in its MCS , generating the plasmid pGAG9 . 

10 From pGAG9, a Sac I-Kpn I 3 kb- fragment was then 

subcloned into pBluescript II SK(-) . A large deletion 
downstream of the Nde I site (located within the tail 
domain of Ad5 fiber) was created in the 3kb-f ragment , 
and the deleted sequence replaced by a Xho I 

15 site-containing linker. This generated the plasmid 

pGAG3, which was the receiving plasmid for all our fiber 
gene constructs, inserted between Nde I and Xho I sites. 
pGAG3- inserted recombinant fibers were re -introduced 
into pGAG9 digested with Nhe I, using homologous 

20 recombination in E. coli BJ5183. The resulting 

recombinant pGAG9 was then excised from the vector with 
Spe I and Pac I, and joined to the isolated 27 
kb-fragment (0-75.5 mu) representing the left-hand 
segment of the Ad5 genome by Cosmid cloning (SuperCos 1 

25 Cosmid Vector Kit and Gigapack III Gold Packaging 
Extract, Stratagene) . The presence of the correct 
recombinant fiber in the cosmid clones were verified by 
PCR and restriction analysis using Spe I and Hind III. 

30 For virus production, recombinant cosmid genomes were 
isolated after restriction with Pac I, and transfected 
into cells expressing the receptors corresponding to the 
fiber- inserted ligand. In standard transfection 
reactions, 2 fig DNA and 3 fil FuGENE (Roche) were used 

35 per 3 5 -mm well, according to the manufacturer's 

protocol. Ad5 genomes with fiber gene containing either 
RGD motifs or the WT fiber knob as ligands were 
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transfected into 293 cells. For the other liganded 
fibers, Ad5 genomes with the EGF ligand were transfected 
into A549 cells, scFv-C242 into Colo 205 cells, 
scFv-G250 into A75 cells and scTCR specific for 
5 Magel/HLA Al into G43 cells. At least three different 
transf ections were simultaneously performed, each one 
using a 6 -well plate. The occurrence of plaques was 
determined by microscopic observation of the transfected 
cell cultures. Verification of recombinant fiber 
10 sequence was made by PCR with specific primers for each 
fiber construct. 

Characterization of recombinant fiber and Ad5 virions. 

15 Cellular expression. To evaluate the level of fiber 

expression, 293 cells were infected with 10 pfu/cell of 
WT, R7-knob and R7-RGD virus. Cells were harvested and 
freeze- thawed four times analyzed by SDS-PAGE and 
western blotting. The blots were reacted with 4D2 . 5 and 

20 revealed with HRP-labeled anti-mouse IgG (DAKO) . 

Fiber content of Ad5 virions. The fiber copy number of 
virions was determined, after CsCl purification of Ad5 
virions, by SDS-PAGE and western blot analysis as above. 

2 5 The virus loads in acryl amide gels were normalized for 

equal amounts of infectious particles, determined by 
virus titration on 293 cells (expressed as plaque 
forming units/ml; PFU/fml), and for equal amounts of 
physical particles (PP) , determined by protein assay 
30 (BioRad) . The number of PP was determined by SDS-PAGE 
analysis and Coomassie blue staining of Ad5 
recombinants, co-electrophoresed with a range of 
standard bovine serum albumin (BSA) samples (2x 
crystallized, BioRad) . Protein content of hexon bands 

3 5 was evaluated by comparison with BSA standard bands, by 

scanning in an automatic densitometer (REP-EDC, Helena 
Laboratories, Beaumont, TX) . The number of Ad5 PP in 
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samples was calculated from the mass of 2.91 x 10e-16 g 
per single virion, i.e. 2.91 mg per 10e+13 virions. The 
infectivity index represented the ratio of infectious to 
physical particles . 

5 

Growth rate of recombinant Ad5 . Growth-rate was measured 
by the production of plaques in 293 cells. In standard 
assays, virus in PBS was adsorbed to cell monolayers (4 
x 10 4 cells per sample) at 37°C for 1 h. The cells were 

10 rinsed once and further incubated in Iscove's medium 
supplemented with 10 % FCS and 50 mg/ml gentamycin at 
37 °C. Cells were harvested at 24, 48 and 72 h after 
infection (pi), centrifuged, dissolved in 0.2 ml PBS and 
f reeze-thawed four times. The supernatants were titered 

15 on 293 cells, and the titer expressed as plaque forming 
units per ml . The total Ad protein content was measured 
by the IDE I A™ Adenovirus kit (DAKO) . 

Gene transduction efficiency. Monolayers of 293 cells in 
20 24 -well plates were infected as described above with 10 
pfu/cell of the recombinant viruses. Cells were 
harvested at 24 h pi, washed with ice-cold PBS followed 
by fixation in 0 . 5 % glutaraldehyde for 15 min. After 
three washes in PBS, the cells were analysed for 
25 transgene GFP expression, using the FL1 emission channel 
in a FACScan cytometer (Becton-Dickinson, San Jose, CA) . 

Assay for fiber knob in viral capsids. The presence or 
absence of accessible knob in virions was assessed by 

30 ELISA. Purified virions were diluted with 50 mM 
carbonate-bicarbonate buffer pH 9.6, to a final 
concentration of 5 x 10 5 PFU/ml . Aliquots of 100 ml were 
adsorbed onto ELISA plates overnight at 4°C. Material 
adsorbed in wells was fixed with 0.5% glutaraldehyde, 

35 after which the wells were washed with PBS containing 

0.1% Tween-20 (washing buffer), and blocked for 2 h with 
200 ml PBS containing 1% BSA. Wells were then washed 
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three times in washing buffer and incubated for 1 h at 
room temperature with 100 ml of biotinylated anti-knob 
mAb 1D6.14 at a concentration of 1 mg/ml . Bound antibody 
was detected using HRP-streptavidin (DAKO) at a dilution 
5 of 1:2,000, for 1 h at room temperature. Colour 

development was obtained with TMB substrate (CanAg 
Diagnostics, Goteborg, Sweden), and stopped with 0.12 M 
HC1 for 10 min. Plates were read in a microtiter plate 
reader set at 450 nm. 

10 

Rationale for the construction of recombinant fibers. 

Knobless fibers with the extrinsic trimerisation motif 
(NRP) from human lung surfactant protein D were 

15 constructed with different numbers of shaft repeats and 
different ligand structures. The three constitutive 
elements of these fiber constructs were considered from 
a structural and functional point of view: (i) the fiber 
scaffold, (ii) the flexible linker and (iii) the cell 

2 0 ligands. 

Fiber scaffold. The fibers were named according to the 
number of shaft repeats and the ligand present. As an 
example, R7-EGF contained the fiber tail, the N-terminal 

25 7 shaft repeats, the NRP motif and the C-terminal EGF 
peptide as the cellular ligand. Rl-RGD fiber had only 
one shaft repeat (the first one) and RGD as the ligand, 
etc. Of three possible shaft lengths, short (1 repeat), 
medium (7 repeats) and long (22 repeats) repeats, the 

30 intermediate size fiber with 7 repeats (R7) was chosen 
as the building scaffold for further constructions and 
studies with different ligands. For reasons discussed 
below, the rationale for chosing R7 was based on its 
high solubility as recombinant protein, the maintenance 

35 of most of the fiber biological functions, and on the 
yields of Ad5/FibR7 virus progeny. The study on the 
comparative advantages of fiber shafts of different 
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lengths was performed with Rl-RGD, R7-RGD and R22-RGD 
fibers, three constructs carrying the same cellular 
ligand RGD . 

5 Linker, To evaluate the possible advantage of an 
additional linker in the fiber shaft domain, two 
R7-NRP-knob fibers were constructed with an extra 
peptide linker inserted between the shaft domain and the 
NRP motif. Two virus-derived linkers were thus tested, 
■10 the Ad5 shaft repeat 17 (L17) and the Ad5 shaft repeat 
22 (L22) . The resulting recombinant fibers were named 
R7-L17-knob and R7~L22-knob, respectively. 

Cellular ligands . Several ligands were tested and 
15 compared for functionality. These ligands were designed 
to re-targeted Ad5 vectors to cell surface molecules of 
broad distribution, like integrins or HLA molecules, or 
to less ubiquitous molecules, like malignant cell 
specific determinants. They varied in size and 
20 complexity from a simple tripeptide motif, like RGD, to 
more elaborated structures, like scTR or scFv, which 
consist of several polypeptide domains with requirement 
for proper folding. 

25 Expression of fibers and ligands in mammalian cells. 

Recombinant fibers were first transiently expressed in 
mammalian cells, using plasmid-transf ected Cos7 cells. 
The recombinant fibers were expressed from two different 
vectors, one designed for intracellular expression of 

3 0 recombinant proteins, the other for their extracellular 
release via the secretory pathway. Cells were assayed 
for the level of fiber expression, cellular localization 
and functionality of their ligands. All the different 
fibers tested exhibited apparent appropriate ligand 

3 5 binding when recovered as secreted proteins. For fibers 
synthesized from the intracellular expression vector, 
nuclear localization was observed in all cases, except 
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for R7-L22-knob. This implied that the cytoplasmic 
transit and the traverse of the nuclear pore occurred as 
efficiently for the modified fibers as for WT Ad5 fiber. 

5 However, none of the extrinsic ligands fused to fibers 
showed the expected binding activity. Even though the 
intracellular^ expressed R7-scTCR fiber could be 
detected within the nucleus, and was stained with an 
anti-Va mAb recognizing an epitope independent of the Va 

10 domain conformation, no staining could be detected with 
another, conformation-dependent mAb, directed against 
the Vb domain. In order to test the possible detrimental 
effect of the fiber domain on the reactivity of 
intracellular ligands, our ligands were also expressed 

15 as separate constructs from the vector pcDNA. There was 
no detectable binding of the nonfused intracellular 
ligands to their specific mAbs, indicating that the 
ligands behaved similarly when free or fused to the 
fiber. 

20 

There was therefore a profound difference in binding 
ability between fiber-ligand fusions designed for 
secretion and designed for cytoplasmic expression. This 
suggested that fiber-ligand fusion proteins designed to 

25 follow the secretory pathway underwent a proper folding 
and were in the correct conformation. This did not imply 
that all intracellular fiber-ligand fusion proteins 
would fold incorrectly, since the cellular environment 
also seemed to play a major role in this process. A 

3 0 significant difference could be detected between the 
cytosol of Cos7 and Sf9 cells, in terms of folding 
pattern and reactivity of fiber-ligand fusion proteins. 
This was the case for the fiber knob domain of 
R7-L22-knob, the EGF ligand of R7-EGF and the scFV 

35 ligand of R7-G250, which showed some reactivity in Sf9 
cells but not in Cos7 cells. 
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Phenotype of recombinant fibers expressed in insect 
cells. To further analyze the properties of the liganded 
fibers, our different constructs were expressed as 
recombinant proteins in the baculovirus- insect cells 
5 system, and assayed for protein solubility, 

trimerisation, glycosylation and formation of penton by 
assembly with penton base in vivo in Sf9 cells. 

Protein solubility and conf ormation. The solubility of 

10 recombinant proteins is usually considered as a good 

indicator of their proper folding. We therefore tested 
all our recombinant fiber proteins for their total 
expression in insect cells, and determined the 
proportion recovered in the soluble fraction of the cell 

15 lysates. All the fiber recombinants were highly 

expressed in Sf9 cells, at levels similar to WT Ad5 
fiber, although their degree of solubility varied 
significantly from one to another. We estimated that 
fibers had a WT-like solubility and thus a proper 

2 0 folding when the soluble fraction contained more than 50 
% of the total fiber expressed. A fiber could be 
considered as misfolded when its soluble fraction 
represented less than 20 % of the total. According to 
these criteria, WT, R7-knob, R7-L17-knob, R7-L22 -knob, 

25 Rl-RGD, R7-RGD and R7-RGD4C fibers were mainly recovered 
in the soluble fraction (60-95 % solubility) . In 
contrast, only 22 % of R22-RGD, and less than 15 % of 
R7-EGF, R7-C242 and R7-scTCR fiber was found to be 
soluble, confirming their incorrect folding suggested by 

30 the absence of reactivity with their respective mAbs . 
The least soluble fiber constructs was R7-G250, which 
was recovered at 95 % in the insoluble fraction (see 
Fig. 5 and Table 7 below) . 



35 
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Table 7. Phenotypic characterization of recombinant 
fibers expressed in baculovirus infected Sf9 cells (a) 





Trimerization 


(b) 






Fiber 


Glycosylation 


NDS-PAGE 


IF 


Assembly 
(c) with 
penton 
base 


Solubility 
(d) (%) 


WT 






+ 




60 


R7-knob 


+ 




+ 




72 


R7-L17-knob 


+ 




+ 




70 


R7-L22-knob 


+ 


+ 


+ 




75 


Rl-RGD 










89 


R7 -RGD 






+ 


+ 


94 


R2 2-RGD 






+ 




22 


R7-RGD4C 


+ 


+ 


+ 


+ 


78 


R7-EGF 




ND(e) 


+ 


ND 


13 


R7-C242 




ND 




ND 


11 


R7-G250 




ND 




ND 


5 


R7 -scTCR 




ND 




ND 


12 



20 

(a) Baculovirus- infected Sf9 cells were harvested at 48 
h after infection and recombinant fibers assayed for 
different biological functions, solubility, O-GldsTAc 
glycosylation, trimerisation, and penton capsomer 

25 formation. 

(b) Trimerisation status of fibers was determined in 
vitro by electrophoresis of Sf9 cell lysates in SDS-gel 
without heat denaturation (NDS-PAGE) , and in situ, by 
immunofluorescence staining of fixed cells, using 

30 anti-trimer mAb (2A6.36). 

(c) The capacity of fibers to assemble with penton base 
to form penton capsomer was assayed by co- infection of 
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the same Sf9 cells by two different recombinant 
baculoviruses, one expressing Ad fiber, the other Ad 
penton base. Sf9 cell lysates were analysed at 48 h 
after co- infection for the occurrence of penton (base + 
5 fiber) , by electrophoresis of native proteins in 

nondenaturing 6 % acrylamide gels (Karayan et al., 1994) 

(d) Solubility was assayed in single 

baculovirus- infected Sf9 cells. Cells harvested at 48 h 
after infection were lysed in isotonic buffer, and cell 

10 lysates divided in two aliquots. One aliquot was 

centrifuged at 10,000 x g for 10 min and supernatant was 
kept, whereas the other was analysed as whole cell 
lysate. Both aliquots were denatured at 10 0°C in 
SDS- sample buffer and electrophoresed in denaturing 

15 SDS-gel, and blotted. Blots were reacted with 

anti- fiber-tail mAb 4D2.5, and radiolabeled secondary 
antibody, as shown in Fig. 5. Immunoblots were 
quantitated by autoradiogram scanning and the results 
were expressed as the percentage of soluble versus total 

20 fiber content. Average of three determinations; SD was 
within 15 % of the reported value for the mean. 

(e) ND, not detectable. 

Trimerisation. The ability of the different fibers to 
25 self -assemble into trimers were electrophoretically 
determined in vitro by NDS-PAGE analysis of cell 
lysates, and immunologically in situ by 

immunofluorescence staining of fixed cells. According to 
their electrophoretic patterns, R7-EGF, R7-C242, R7-G250 

3 0 and R7-scTCR were incapable of forming homotrimers, 
whereas R7-knob, R7-L17-knob R7-L22-knob, R7 - RGD , 
R22-RGD and R7-RGD4C trimerized at WT levels (Table 7) . 
However, immunofluorescence staining of cells using the 
anti-trimer mAb 2A6.36 showed that all except R7-C242 

35 and R7-G2 50 formed trimers (Table 7) . The apparent 

discrepancy likely resided in the method of detection, 
or in the possibility that R7-EGF, R7-C242, R7-G250*and 



WO 02/08263 



PCT/GB01/03252 



- 81 - 

R7-scTCR fiber trimers became unstable in vitro, or 
both. Whatever the reason, if fiber trimers formed with 
a very low efficiency within the cells, 
immunofluorescence staining was, in this case, more 
5 sensitive and more appropriate for trimer detection. 

Glycosylation. Likewise, the glycosylation of fibers was 
analysed by western blotting using RL2, a mAb specific 
for peptide -linked O-GlcNAc residues. The fiber 
10 constructs which reacted with RL2 were, besides the WT, 
R7-L17-knob, R7-L22 -knob, R7-knob and R7-RGD4C (Table 
7) . 

Assembly with penton base. When Sf9 cells were infected 
15 with two different baculoviruses , one expressing the Ad 
fiber, the other the penton base, the two proteins are 
capable of interacting intracellularly to form penton 
capsomers. When assayed for this property, the fiber 
proteins R7-EGF, R7-C242, R7-G250 and R7-scTCR had lost 
20 their capacity to assemble with penton base, whereas the 
other eight recombinant fibers had conserved their 
assembly function (Table 7) . 

Rescue of recombinant fiber genes into the viral genome 
25 and viability of the viruses. Our recombinant fiber 
genes were reinserted into the Ad5 viral genome in 
replacement of the WT fiber gene. Each recombinant viral 
genome was then introduced by transfection into the 
corresponding cell line which express the proper 
30 receptor for the recombinant fiber. The rescue of 
viable recombinant viruses was assessed by plaque 
development. Plagues were observed for the Ad5 genomes 
harboring the WT fiber, and recombinant fibers R7-knob, 
R7-L17-knob, Rl-RGD, R7-RGD and R7-RGD4C . 

35 

Fiber content, infectivity and growth rate of 
recombinant viruses. The recombinant R7-RGD Ad carrying 
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seven shaft repeats had been selected as the best choice 
in terms of fiber trimerisation, stability, assembly 
with penton base and ligand binding. However, this 
knobless Ad was less efficient in virus assembly and 
5 production, compared to WT virus. In this example, the 
characteristics of the Ad-R7-knob, which carried the 
knob domain, were compared to those of WT Ad and 
knobless Ad-R7-RGD. "We first assayed for the production 
of fibers in 293 cells infected with each of the three 
10 Ad at different times pi. The production of fiber 

proteins from R7-knob Ad was comparable to that of the 
WT Ad, whereas the level of fiber protein from the 
R7-RGD Ad was significantly lower. 

15 We then compared the fiber content per virion of the 
R7-knob Ad with that of the WT Ad. The two viruses 
exhibited similar amounts of fiber content when 
normalised to the number of physical particles (PP) but 
there were more fiber protein in the R7-knob Ad when the 

20 two virus samples were normalised to the number of 

infectious particles (PFU) . This indicated that R7-knob 
Ad was less infectious than the WT. This was confirmed 
by the infectivity index (PFU : PP ratio) , which was 
calculated from the infectious titer (expressed as 

25 PFU/ml, as determined by titration on 293 cells), and 

from the concentration of PP (determined by biochemical 
methods). The infectivity index of the Ad-R7-knob (1:100 
to 1: 2 00) was 2- to 8- fold lower than that of WT Ad5 
(1:25-1: 50). The infectivity index of Ad-R7-RGD (1 : 

30 200 - 1 : 500) was in a similar range as that of 
Ad-R7-knob. 

The growth rate of the Ad-R7-knob was compared to that 
of WT virus. Aliquots of cell samples were infected at 
35 the same MOI , and infectious virus progeny, determined 

by plaque assays on 293 cells. The growth curves for the 
three viruses were similar, but the production of 
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infectious Ad was inferior to the WT for both Ad- 
R7-knob and Ad-R7 -RGD . Virus uptake was also examined 
for Ad-R7~knob, and estimated from its ability to 
transduce the GFP reporter gene into 2 93 cells, as 
5 compared to Ad5 -GFP carrying WT fibers (Ad-WTFib) . The 

transducing capacity per PP was found to be 6-fold lower 
for the R7-knob Ad than for Ad-WTFib . 

Receptor-binding capacity of recombinant virions. 

10 Ad-FibR7-knob was assayed for its receptor binding 
capacity, as compared to WT virus. Both WT Ad5 and 
Ad-FibR7-knob virions were found to react with anti-knob 
mAb in ELISA, suggesting that the knob epitopes were 
accessible to mAb on both types of virions. However, the 

15 knob immunoreactivity was higher for WT Ad than for 
Ad-FibR7-knob. This suggested that the cell binding 
determinants of virions were more efficient or/and had a 
higher affinity when the knob was carried by a 
22-shaf t-repeat fiber, as in WT Ad5 , than by a shorter, 

20 7-repeat fiber. 

These results show that the ligands which were found to 
be improperly folded within the cytosol and the nucleus 
all had a conformation that was highly dependent on the 
25 formation of disulfide bridges. Thus antibodies or 

fragments thereof should be used which lack disulfide 
bridges while retaining native or modified epitope 
binding. 

3 0 The number of repeats in the shaft portion should 

preferably be at least about 7, e.g. 6-12. The presence 
of a wild type knob (e.g. as one of two types of fibre 
proteins wherein the second is modified for re- 
targeting) may be beneficial for generation of 

35 recombinant viable Ad virons. 
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Sequence Listing 

[SEQ. ID. NO. : 1] - Adenovirus 5 Fiber Protein encodi 
sequence . 

5 

Sequence Range: 1 to 17 4 6 



10 

atgaagcgcg 
10 tacttcgcgc 

60 

tgacacggaa 
actgtgcctt 

15 

110 

tatcccccaa 
atagggggtt 



20 

caagaccgtc 
gttctggcag 

70 

accggtcctc 
tggccaggag 

120 

tgggtttcaa 
acccaaagtt 



30 

tgaagatacc 
acttctatgg 

80 

caactgtgcc 
gttgacacgg 

130 

gagagtcccc 
ctctcagggg 



40 

ttcaaccccg 
aagttggggc 

90 

ttttcttact 
aaaagaatga 

140 

ctggggtact 
gaccccatga 



50 

tgtatccata 
acataggtat 

100 

cctccctttg 
ggagggaaac 

150 

ctctttgcgc 
gagaaacgcg 



20 160 170 180 190 200 

ctatccgaac ctctagttac ctccaatggc atgcttgcgc tcaaaatggg 
gataggcttg gagatcaatg gaggttaccg tacgaacgcg agttttaccc 

210 220 230 240 250 

25 caacggcctc tctctggacg aggccggcaa ccttacctcc caaaatgtaa 

gttgccggag agagacctgc tccggccgtt ggaatggagg gttttacatt 

260 270 280 290 300 

ccactgtgag cccacctctc aaaaaaacca agtcaaacat aaacctggaa 
30 ggtgacactc gggtggagag tttttttggt tcagtttgta tttggacctt 

310 320 330 340 350 

atatctgcac ccctcacagt tacctcagaa gccctaactg tggctgccgc 

tatagacgtg gggagtgtca atggagtctt cgggattgac accgacggcg 

35 



360 370 380 390 400 

cgcacctcta atggtcgcgg gcaacacact caccatgcaa tcacaggccc 
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gcgtggagat taccagcgcc cgttgtgtga gtggtacgtt agtgtccggg 

410 420 430 440 450 

cgctaaccgt gcacgactcc aaacttagca ttgccaccca aggacccctc 

gcgattggca cgtgctgagg tttgaatcgt aacggtgggt tcctggggag 

460 470 480 490 500 

acagtgtcag aaggaaagct agccctgcaa acatcaggcc ccctcaccac 

tgtcacagtc ttcctttcga tcgggacgtt tgtagtccgg gggagtggtg 

• 510 520 530 540 550 

caccgatagc agtaccctta ctatcactgc ctcaccccct ctaactactg 

gtggctatcg tcatgggaat gatagtgacg gagtggggga gattgatgac 



15 560 570 580 590 600 

ccactggtag cttgggcatt gacttgaaag agcccattta tacacaaaat 

ggtgaccatc gaacccgtaa ctgaactttc tcgggtaaat atgtgtttta 

610 620 630 640 650 

2 0 ggaaaactag gactaaagta cggggctcct ttgcatgtaa cagacgacct 

ccttttgatc ctgatttcat gccccgagga aacgtacatt gtctgctgga 

660 670 680 690 700 

aaacactttg accgtagcaa ctggtccagg tgtgactatt aataatactt 

25 tttgtgaaac tggcatcgtt gaccaggtcc acactgataa ttattatgaa 

710 720 730 740 750 

ccttgcaaac taaagttact ggagccttgg gttttgattc acaaggcaat 

ggaacgtttg atttcaatga cctcggaacc caaaactaag tgttccgtta 



30 



760 770 780 790 800 

atgcaactta atgtagcagg aggactaagg attgattctc aaaacagacg 
tacgttgaat tacatcgtcc tcctgattcc taactaagag ttttgtctgc 



35 



810 820 830 840 850 

ccttatactt gatgttagtt atccgtttga tgctcaaaac caactaaatc 
ggaatatgaa ctacaatcaa taggcaaact acgagttttg gttgatttag 
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860 870 880 890 900 

taagactagg acagggccct ctttttataa actcagccca caacttggat 

attctgatcc tgtcccggga gaaaaatatt tgagtcgggt gttgaaccta 

5 910 920 930 940 950 

attaactaca acaaaggcct ttacttgttt acagcttcaa acaattccaa 

taattgatgt tgtttccgga aatgaacaaa tgtcgaagtt tgttaaggtt 



960 970 980 990 1000 

10 aaagcttgag gttaacctaa gcactgccaa ggggttgatg tttgacgcta 

tttcgaactc caattggatt cgtgacggtt ccccaactac aaactgcgat 

1010 1020 1030 1040 1050 

cagccatagc cattaatgca ggagatgggc ttgaatttgg ttcacctaat 

15 gtcggtatcg gtaattacgt cctctacccg aacttaaacc aagtggatta 



1060 
gcaccaaaca 
cgtggtttgt 

1110 
tgattcaaac 
actaagtttg 



1070 
caaatcccct 
gtttagggga 

1120 
aaggctatgg 
ttccgatacc 



1080 
caaaacaaaa 
gttttgtttt 

1130 
ttcctaaact 
aaggatttga 



1090 
attggccatg 
taaccggtac 

1140 
aggaactggc 
tccttgaccg 



1100 
gcctagaatt 
cggatcttaa 

1150 
cttagttttg 
gaatcaaaac 



25 1160 1170 1180 1190 1200 

acagcacagg tgccattaca gtaggaaaca aaaataatga taagctaact 
tgtcgtgtcc acggtaatgt catcctttgt ttttattact attcgattga 



1210 1220 1230 1240 1250 

30 ttgtggacca caccagctcc atctcctaac tgtagactaa atgcagagaa 

aacacctggt gtggtcgagg tagaggattg acatctgatt tacgtctctt 



1260 1270 1280 1290 1300 

agatgctaaa ctcactttgg tcttaacaaa atgtggcagt caaatacttg 

35 tctacgattt gagtgaaacc agaattgttt tacaccgtca gtttatgaac 

1310 1320 1330 1340 1350 

ctacagtttc agttttggct gttaaaggca gtttggctcc aatatctgga 
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gatgtcaaag tcaaaaccga caatttccgt caaaccgagg ttatagacct 

1360 1370 1380 1390 1400 

acagttcaaa gtgctcatct tattataaga tttgacgaaa atggagtgct 
tgtcaagttt cacgagtaga ataatattct aaactgcttt tacctcacga 

1410 1420 1430 1440 1450 

actaaacaat tccttcctgg acccagaata ttggaacttt agaaatggag 
tgatttgtta aggaaggacc tgggtcttat aaccttgaaa tctttacctc 

1460 1470 1480 1490 1500 

atcttactga aggcacagcc tatacaaacg gtgttggatt tatgcctaac 
tagaatgact tccgtgtcgg atatgtttgc cacaacctaa atacggattg 

1510 1520 1530 1540 1550 

ctatcagctt atccaaaatc tcacggtaaa actgccaaaa gtaacattgt 
gatagtcgaa taggttttag agtgccattt tgacggtttt cattgtaaca 

1560 1570 1580 1590 1600 

cagtcaagtt tacttaaacg gagacaaaac taaacctgta acactaacca 
gtcagttcaa atgaatttgc ctctgttttg atttggacat tgtgattggt 

1610 1620 1630 1640 1650 

ttacactaaa cggtacacag gaaacaggag acacaactcc aagtgcatac 
aatgtgattt gccatgtgtc ctttgtcctc tgtgttgagg ttcacgtatg 

1660 1670 1680 1690 1700 

tctatgtcat tttcatggga ctggtctggc cacaactaca ttaatgaaat 
agatacagta aaagtaccct gaccagaccg gtgttgatgt aattacttta 

1710 1720 1730 1740 

atttgccaca tcctcttaca ctttttcata cattgcccaa gaataa 
taaacggtgt aggagaatgt gaaaaagtat gtaacgggtt cttatt 
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[SEQ. ID. NO. : 2] - Neck Region Peptide from Human Lung 
Surfactant Protein D. 

PDVAS LRQQVE ALQGQ VQHLQAAFSQ YKKVEL FPNG 

5 

[SEQ. ID. NO.: 3] - Recombinant Fiber Al 

gaattcg atg aag cgc gca aga ccg tct gaa gat acc ttc aac ccc gtg tat 
cca tat gac acg gaa acc ggt cct cca act gtg cct ttt ctt act cct ccc 
0 ttt gta tec ccc aat ggg ttt caa gag agt ccc cct ggg gta etc tct ttg 

cgc eta tec gaa cct eta gtt acc tec aat ggc atg cct gac gta gca age 
tta cga caa cag gta gaa gec ttg caa ggg cag gta caa cac tta cag gcg 
gca ttt age caa tac aaa aag gta gag ttg ttt cca aac gga gee aag aag 
ctg aac gac gec cag gec ccc aag age gac cca teg ate taa etc gag 

5 

[SEQ. ID. NO. : 4] - Human epidermal growth factor (EGF) 

Sequence Range: 1 to 183 

0 10 20 30 40 50 

gggaattcat gaactccgac tccgaatgtc cattgtccca egaeggttae 
cccttaagta cttgaggctg aggcttacag gtaacagggt getgecaatg 
M NSD SEC PLSH DGY> 



60 70 80 90 100 

tgtttgcacg acggtgtttg tatgtacatc gaagctttgg acaagtaege 

acaaacgtgc tgccacaaac atacatgtag cttcgaaacc tgttcatgcg 

CLH DGVC MYI EAL DKYA> 



110 120 130 140 150 

ttgtaactgt gttgttggtt acatcggtga aagatgtcaa tacagagact 

aacattgaca caacaaccaa tgtagecact ttctacagtt atgtctctga 

CNC VVG YIGE RCQ YRD> 
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160 170 180 

tgaagtggtg ggaattgaga tgataagaat tec 
acttcaccac ccttaactct actattctta agg 
LKWW ELR *> 

5 

[SEQ. ID. NO. : 5] - Amino acid linker from Staphylococcal 
protein A 

10 Sequence Range: 1 to 39 

10 20 30 

gecaagaage tgaacgaege ccaggccccc aagagegae 
eggttctteg acttgetgeg ggtccggggg ttctcgctg 
15 A K K LNDA QAP KSD> 

a SPA 48-60 a 



[SEQ. ID. NO. : 6] - Fiber construct Al EGF 

20 Sequence Range: 1 to 513 

10 20 30 40 50 

atgaagcgcg caagaccgtc tgaagatacc ttcaaccccg tgtatccata 
tacttcgcgc gttctggcag acttctatgg aagttggggc acataggtat 
25 MKR A R P S EDT FNP VYPY> 

b b TAIL b b _____> 



60 70 80 90 100 

tgacacggaa accggtcctc caactgtgcc ttttcttact cctccctttg 
30 actgtgcctt tggecaggag gttgacaegg aaaagaatga ggagggaaac 

DTE TGP PTVP FLT PPF> 

b b TAIL b b > 



110 120 130 140 150 

35 tatcccccaa tgggtttcaa gagagtcccc ctggggtact ctctttgcgc 

atagggggtt acccaaagtt ctctcagggg gaccccatga gagaaacgeg 

L S L R> 
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a_ 

N GFQ ESP PGV> 
b TAIL b > 



5 160 170 180 190 200 

ctatccgaac ctctagttac ctccaatggc atgcctgacg tagcaagctt 
gataggcttg gagatcaatg gaggttaccg tacggactgc atcgttcgaa 
LSE PLVT SNG M> 

a Rl a a > 

10 P D V A S L> 

LUNG SURFAC > 

210 220 230 240 250 

acgacaacag gtagaagcct tgcaagggca ggtacaacac ttacaggcgg 
15 tgctgttgtc catcttcgga acgttcccgt ccatgttgtg aatgtccgcc 

RQQ V E A LQGQ VQH LQA> 
LUNG SURFACTANT PROTEIN D TRIM c > 

260 270 280 290 300 

2 0 catttagcca atacaaaaag gtagagttgt ttccaaacgg agccaagaag 

gtaaatcggt tatgtttttc catctcaaca aaggtttgcc tcggttcttc 
AFSQ YKK VEL FPNG> 
LUNG SURFACTANT PROTEIN D TRIM c_> 

A K K> 

25 > 



>Clal 



310 320 330 | 340 350 

30 ctgaacgacg cccaggcccc caagagcgac ccatcgatca tgaactccga 

gacttgctgc gggtccgggg gttctcgctg ggtagctagt acttgaggct 

M N S D> 
_d > 

LND AQAP KSD> 
35 SPA 48-60__e > 

P S I> 

> 
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360 370 380 390 400 

ctccgaatgt ccattgtccc acgacggtta ctgtttgcac gacggtgttt 
gaggcttaca ggtaacaggg tgctgccaat gacaaacgtg ctgccacaaa 
5 SEC PLS HDGY CLH DGV> 

EPIDERMAL GROWTH FACTOR (UROGASTRONE) ; CODON_ST > 

410 420 430 440 450 

gtatgtacat cgaagctttg gacaagtacg cttgtaactg tgttgttggt 
10 catacatgta gcttcgaaac ctgttcatgc gaacattgac acaacaacca 

CM YI EAL DKY A C N C VVG> 
EPIDERMAL GROWTH FACTOR (UROGASTRONE) ; CODON_ST > 

460 470 480 490 500 

15 tacatcggtg aaagatgtca atacagagac ttgaagtggt gggaattgag 

atgtagccac tttctacagt tatgtctctg aacttcacca cccttaactc 

YIG E-RCQ YRD LKW WELR> 
EPIDERMAL GROWTH FACTOR (UROGASTRONE) ; CODON_ST > 

20 >xhol 

I 

| 510 
atgactcgag ggg 
tactgagctc ccc 
25 X> 

> 

[SEQ. ID. NO. : 7] - Fiber Construct A7 EGF 

3 0 Sequence Range: 1 to 811 

>EcoRl 
I 

| 10 20 30 40 50 

35 gaattcgatg aagcgcgcaa gaccgtctga agataccttc aaccccgtgt 



WO 02/08263 



PCT/GBO 1/03252 



- - 92 - - 



cttaagctac ttcgcgcgtt ctggcagact tctatggaag ttggggcaca 
M> 
> 

M KRA RPSE DTF NPV> 
' 5 J j TAIL j _> 

60 70 80 90 100 

atccatatga cacggaaacc ggtcctccaa ctgtgccttt tcttactcct 
taggtatact gtgcctttgg ccaggaggtt gacacggaaa agaatgagga 
10 YPYD TET GPP TVPF LTP> 

3 j TAIL j j > 

110 120 130 140 150 

ccctttgtat cccccaatgg gtttcaagag agtccccctg gggtactctc 
15 gggaaacata gggggttacc caaagttctc tcagggggac cccatgagag 

L S> 



PFV SPNG FQE SPP GV> 
j TAIL j j > 

20 

160 170 180 190 200 

tttgcgccta tccgaacctc tagttacctc caatggcatg cttgcgctca 
aaacgcggat aggcttggag atcaatggag gttaccgtac gaacgcgagt 

L A L> 

25 : 

LRL SEP LVTS NGM> 
h REPEAT 1 h > 



210 220 230 240 250 

3 0 aaatgggcaa cggcctctct ctggacgagg ccggcaacct tacctcccaa 

tttacccgtt gccggagaga gacctgctcc ggccgttgga atggagggtt 

L T S Q> 
_f > 

KMGN GLS LDE A G N> 
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260 270 280 290 300 

aatgtaacca ctgtgagccc acctctcaaa aaaaccaagt caaacataaa 
ttacattggt gacactcggg tggagagttt ttttggttca gtttgtattt 

I N> 
> 

NVT TVSP P L K KTK S N> - 
f REPEAT 3 f f > 



310 320 330 340 350 

cctggaaata tctgcacccc tcacagttac ctcagaagcc ctaactgtgg 
ggacctttat agacgtgggg agtgtcaatg gagtcttcgg gattgacacc 

L T V> 

> 

LEI SAP LTVT SEA> 
e REPEAT 4 e > 



360 370 380 390 400 

ctgccgccgc acctctaatg gtcgcgggca acacactcac catgcaatca 

gacggcggcg tggagattac cagcgcccgt tgtgtgagtg gtacgttagt 

L T M Q S> 
REPEAT 6 > 

AAAA PLM V A G NT> 
d REPEAT 5 d > 



410 420 430 440 450 

caggccccgc taaccgtgca cgactccaaa cttagcattg ccacccaagg 
gtccggggcg attggcacgt gctgaggttt gaatcgtaac ggtgggttcc 

LSI A T Q G> 
REPEAT 7 > 

QAP LTVH DSK> 
REPEAT 6c > 



460 470 480 490 500 

acccctcaca gtgtcagaag gaaagctagc ccctgacgta gcaagcttac 
tggggagtgt cacagtcttc ctttcgatcg gggactgcat cgttcgaatg 
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30 



L A> 
a_> 

PLT VSE GK> 
REPEAT 7 b > 



D P D V A S L> 

LUNG SURFACTA > 

510 520 530 540 550 
gacaacaggt agaagccttg caagggcagg tacaacactt acaggcggca 
ctgttgtcca tcttcggaac gttcccgtcc atgttgtgaa tgtccgccgt 
10 R Q Q V EAL QGQ VQHL QAA> 
. LUNG SURFACTANT PROTEIN D TRIM k > 

560 570 580 590 600 

tttagccaat acaaaaaggt agagttgttt ccaaacggag ccaagaagct 
15 aaatcggtta tgtttttcca tctcaacaaa ggtttgcctc ggttcttcga 

FSQ YKKV ELF PNG> 
LUNG SURFACTANT PROTEIN D TRIM > 

A K K L> 
_m > 

20 

>Clal 
I 

610 620 630 | 640 650 

gaacgacgcc caggccccca agagcgaccc atcgatcatg aactccgact 
25 cttgctgcgg gtccgggggt tctcgctggg tagctagtac ttgaggctga 

M N S D> 



NDA QAP KSD> 
SPA 48-60__m > 

P S I> 
n > 



35 



660 670 680 690 700 

ccgaatgtcc attgtcccac gacggttact gtttgcacga cggtgtttgt 
ggcttacagg taacagggtg ctgccaatga caaacgtgct gccacaaaca 
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SECP LSH DGY CLHD G V C> 
EPIDERMAL GROWTH FACTOR (UROGASTRONE) ; CODON_ST > 



710 720 730 740 750 

5 atgtacatcg aagctttgga caagtacgct tgtaactgtg ttgttggtta 

tacatgtagc ttcgaaacct gttcatgcga acattgacac aacaaccaat 
MYI EALD KYA CNC VVGY> 

EPIDERMAL GROWTH FACTOR (UROGASTRONE) / CODON_ST > 

760 770 780 790 800 

10 catcggtgaa agatgtcaat acagagactt gaagtggtgg gaattgagat 

gtagccactt tctacagtta tgtctctgaa cttcaccacc cttaactcta 

IGE RCQ YRDL KWW ELR> 
EPIDERMAL GROWTH FACTOR (UROGASTRONE) ; CODON_ST > 



15 >xhol 

I 

f 810 
gactcgaggg g 
ctgagctccc c 
2 0 X> 

_> 

[SEQ, ID. NO. : 8] - G250 Construct 

25 Sequence Range: 1 to 1056 

10 20 30 40 50 

gacattgtga tgacccagtc tcaaagattc atgtccacaa cagtaggaga 
ctgtaacact actgggtcag agtttctaag tacaggtgtt gtcatcctct 
30 DIV MTQS QRF MST TVGD> 

C c VK c C > 



35 



60 70 80 90 100 

cagggtcagc atcacctgca aggccagtca gaatgtggtt tctgctgttg 
gtcccagtcg tagtggacgt tccggtcagt cttacaccaa agacgacaac 
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RVS ITC KASQ NVV SAV> 
c c VK c c > 



110 120 130 140 150 
5 cctggtatca acagaaacca ggacaatctc ctaaactact gatttactca 
ggaccatagt tgtctttggt cctgttagag gatttgatga ctaaatgagt 
A W Y Q QKP GQS PKLL IYS> 
c c VK c c > 

10 160 170 180 190 200 

gcatccaatc ggtacactgg agtccctgat cgcttcacag gcagtggatc 
cgtaggttag ccatgtgacc tcagggacta gcgaagtgtc cgtcacctag 

A S N RYTG VPD RFT G S G S> 
c c VK c c > 

15 

210 220 230 240 250 

tgggacagat ttcactcfcca ccattagcaa tatgcagtct gaagacctgg 
accctgtcta aagtgagagt ggtaatcgtt atacgtcaga cttctggacc 
GTD FTL TISN HQS EDL> 

2 0 c c VK c c > 

260 270 280 290 300 

ctgatttttt ctgtcaacaa tatagcaact atccgtggac gttcggtgga 
gactaaaaaa gacagttgtt atatcgttga taggcacctg caagccacct 
25 ADFF CQQ YSN YPWT FGG> 

c c VK c c > 

310 320 330 340 350 

ggcaccaagc tggaaatcaa aggatctggc tctacttccg gtagcggcaa 

3 0 ccgtggttcg acctttaqtt tcctagaccg agatgaaggc catcgccgtt 

G T K L E I K> 
VK c__> 

GSG STS GSGK> 
d LINK d > 

35 
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360 370 380 390 400 

atcctctgaa ggcaaaggta ctagagacgt gaagctcgtg gagtctgggg 
taggagactt ccgtttccat gatctctgca cttcgagcac ctcagacccc 

DV KLV ESG> 
a VH a > 

SSE GKG TR> 
LINK d > 



410 420 430 440 450 

gaggcttagt gaagcttgga gggtccctga aactctcctg tgcagcctct 
ctccgaatca cttcgaacct cccagggact ttgagaggac acgtcggaga 
GGLV KLG GSL KLSC A A S> 
a a VH a a > 



460 47.0 480 490 500 

ggattcactt tcagtaacta ttacatgtct tgggttcgcc agactccaga 
cctaagtgaa agtcattgat aatgtacaga acccaagcgg tctgaggtct 

GFT FSNY YMS WVR QTPE> 
a a VH a a > 



510 520 530 540 550 

gaagaggctg gagttggtcg cagccattaa tagtgatggt ggtatcacct 
cttctccgac ctcaaccagc gtcggtaatt atcactacca ccatagtgga 

KRL ELV A A I N SDG GIT> 
a a VH a a > 

560 570 580 590 600 

actatctaga cactgtgaag ggccgattca ccatttcaag agacaatgcc 
tgatagatct gtgacacttc ccggctaagt ggtaaagttc tctgttacgg 
YYLD TVK GRF TISR DNA> 
a a VH a a > 



610 620 630 640 650 

aagaacaccc tgtacctgca aatgagcagt ctgaagtctg aggacacagc 
ttcttgtggg acatggacgt ttactcgtca gacttcagac tcctgtgtcg 
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KNT LYLQ MSS LKS EDTA> 
a a VH a a > 



. 660 670 680 690 700 
cttgttttac tgtgcaagac accgctcggg ctacttttct atggactact 
gaacaaaatg acacgttctg tggcgagccc gatgaaaaga tacctgatga 
LFY CAR HRSG Y F S MDY> 
a a VH a a > 



710 720 730 740 750 
ggggtcaagg aacctcagtc accgtctcct catgcccacc gtgcccagca 
ccccagttcc ttggagtcag tggcagagga gtacgggtgg cacgggtcgt 
WGQG TSV TVS S> 
_a VH a a > 

A> 
> 

C P P C P> 
HINGE > 



20 760 . 770- 780 790 800 

cctgaactcc tagggggacc gtcagtcttc ctcttccccc caaaacccaa 
ggacttgagg atccccctgg cagtcagaag gagaaggggg gttttgggtt 

PEL LGGP S V F LFP PKPK> 
b b CH2 b b > 

25 

810 820 830 840 850 

ggacaccctc atgatctccc ggacccctga ggtcacatgc gtggtggtgg 
cctgtgggag tactagaggg cctggggact ccagtgtacg caccaccacc 
DTL MIS RTPE VTC VVV> 
3 0 b b CH2 b b > 



860 870 880 890 900 

acgtgagcca cgaagaccct gaggtcaagt tcaactggta cgtggacggc 

tgcactcggt gcttctggga ctccagttca agttgaccat gcacctgccg 

35 DVSH EDP EVK FNWY VDG> 



10 
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b b CH2 b b > 



910 920 930 940 950 

gtggaggtgc ataatgccaa gacaaagccg cgggaggagc agtacaacag 
5 cacctccacg tattacggtt ctgtttcggc gccctcctcg tcatgttgtc 

VEV H IS! A K TKP REE QYNS> 
b b CH2 b b > 



960 970 980 990 1000 

10 cacgtaccgg gtggtcagcg tcctcaccgt cctgcaccag gactggctga 

gtgcatggcc caccagtcgc aggagtggca ggacgtggtc ctgaccgact 

TYR V V S VLTV LHQ DWL> 
b b CH2 b b > 



15 1010 1020 1030 1040 1050 

atggcaagga gtacaagtgc aaggtctcca acaaagccct cccagccccc 
taccgttcct catgttcacg ttccagaggt tgtttcggga gggtcggggg 
NGKE YKC KVS NKAL PAP> 
b b CH2 b b > 

20 

atcgag 
tagctc 
I E> 



25 

[SEQ. ID. NO. : 9] - C242 Construct 

30 Sequence Range: 1 to 7 93 

>Sfi >EcoRl >Clal 

I I I 

| 10 |20 | 30 40 50 

35 gcggcccagc cggccacgaa ttcatcgatg gatattgtga tgactcaggc 



PCT/GB01/03252 
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cgccgggtcg gccggtgctt aagtagctac ctataacact actgagtccg 
A Q P A T N SSM> 
b_BACKPRIMER > 

D I V M T Q A> 
VK PHARMACIA > 

A> 

60 70 80 90 100 

tgcaccctct gtacctgtca ctcctggaga gtcagtatcc atctcctgca 
acgtgggaga catggacagt gaggacctct cagtcatagg tagaggacgt 
APS VPV TPGE-. SVS ISO 

d VK PHARMACIA d > 



110 120 130 140 150 

ggtctagtaa gagtctcctg catagtaatg gcaacactta cttgtattgg 
ccagatcatt ctcagaggac gtatcattac cgttgtgaat gaacataacc 
RSSK SLL HSN GNTY LYW> 

d VK PHARMACIA d > 

SLL HSN GNTY LY> 
f CDR1 f > 



160 170 180 190 200 

ttcctgcaga ggccaggcca gtctcctcag ctcctgatat atcggatgtc 
aaggacgtct ccggtccggt cagaggagtc gaggactata tagcctacag 
FLQ RPGQ SPQ LLI YRMS> 

d VK PHARMACIA d > 

R M S> 



210 220 230 240 250 

caaccttgtc tcaggagtcc cagacaggtt cagtggcagt gggtcaggaa 
gttggaacag agtcctcagg gtctgtccaa gtcaccgtca cccagtcctt 
NLV SGV PDRF SGS GSG> 

d VK PHARMACIA d _> 
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N L V S> 
e > 



260 270 280 290 300 
5 ctgctttcac actgagaatc agtagagtgg aggctgagga tgtgggtgtt 
gacgaaagtg tgactcttag tcatctcacc tccgactcct acacccacaa 
T A F T LRI SRV E A E D V G V> 
d VK PHARMACIA d > 



10 310 320 330 340 350 

tattactgtc tgcaacatct agagtatccg ttcacgttcg gtcctgggac 
ataatgacag acgttgtaga tctcataggc aagtgcaagc caggaccctg 
LQHL EYP F T> 

_a C DR3_ a > 

15 YYC LQHL EYP FTF GPG T> 

d VK PHARMACIA d > 



360 370 380 390 400 

caagctggag ctgaaacggc ccccggactt tgttcccccg gccgctagtt 
20 gttcgacctc gactttgccg ggggcctgaa acaagggggc cggcgatcaa 

K L E L K R> 
VK PHARMACIA > 

PPDF VPP AAS> 
_1 1 8 2 5 1 > 

25 

410 420 430 440 450 

tccctgatca ctcccctcgt ggccaggtcc agttggtgca gtctggacct 
agggactagt gaggggagca ccggtccagg tcaaccacgt cagacctgga 

QV QLVQ SGP> 

3 0 VH PHARMACIA [SPLIT] > 

FPDH SPR G> 
825 1 > 



35 



460 470 480 490 500 

gagctgaaga agcctggaga gacagtcaag atctcctgca aggcttctga 
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ctcgacttct tcggacctct ctgtcagttc tagaggacgt tccgaagact 

ELK KPGE T V K ISC K A S D> 
h VH PHARMACIA [SPLIT] h > 



510 520 530 540 550 

ttataccttc acatactatg gaatgaactg ggtgaagcag gctccgggaa 
aatatggaag tgtatgatac cttacttgac ccacttcgtc cgaggccctt 
YTF TYY GMNW VKQ APG> 

h VH PHARMACIA [SPLIT J h > 

Y Y G M N> 
CDR1 > 



560 570 580 590 600 

agggtttaaa gtggatgggc tggatagaca ccaccactgg agagccaaca 
tcccaaattt cacctacccg acctatctgt ggtggtgacc tctcggttgt 
KGLK WMG W I D TTTG EPT> 

h VH PHARMACIA [SPLIT] h > 

WID TTTG EPT> 
CDR2 [SPLIT] > 



610 620 630 640 650 

tatgctgaag attttaaggg acggattgcc ttctctttgg agacctctgc 
atacgacttc taaaattccc tgcctaacgg aagagaaacc tctggagacg 

YAE DFKG RIA FSL ETSA> 
h VH PHARMACIA [SPLIT] h > 

YAE D F K G> 
CDR2 [SPLIT] i_> 



660 670 680 690 700 

cagcactgcc tatttgcaga tcaaaaacct caaaaatgag gacacggcta 
gtcgtgacgg ataaacgtct agtttttgga gtttttactc ctgtgccgat 

STA YLQ IKNL KNE DTA> 
h VH PHARMACIA [SPLIT] h > 



710 720 730 740 750 
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catatttctg tgcaagacgg gggccttaca actggtactt tgatgtctgg 
gtataaagac acgttctgcc cccggaatgt tgaccatgaa actacagacc 

R GPY NWYF DV> 

g C DR3 g > 

5 TYFC ARR GPY NWYF DV W> 
h VH PHARMACIA [SPLIT] h > 

>Xho (-) StopXho_adapt_ [Split] 
I 

10 760 770 780 [ 790 

ggccaaggga ccacggtcac cgtctcctca ctcgattaac teg 
ccggttccct ggtgccagtg gcagaggagt gagctaattg age 

L X> 
> 

15 GQG TTVT VSS> 

VH PHARMACIA [SPLIT] > 
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[SEQ. ID. NO. : 10] - Fiber Construct Al G250 



Sequence Range: 1 to 1411 



>EcoRl 



| 10 20 30 40 50 

gaattcgatg aagcgcgcaa gaccgtctga agataccttc aaccccgtgt 
10 cttaagctac ttcgcgcgtt ctggcagact tctatggaag ttggggcaca 

M> 
> 

M KRA RPSE DTF N P V> 
f f T A I L_ f > 

15 

60 70 80 90 100 
atccatatga cacggaaacc ggtcctccaa ctgtgccttt tcttactcct 
taggtatact gtgcctttgg ccaggaggtt gacacggaaa agaatgagga 
YPYD TET GPP TVPF LTP> 
20 f f TAIL f f > 

110 120 130 140 150 

ccctttgtat cccccaatgg gtttcaagag agtccccctg gggtactctc 
gggaaacata gggggttacc caaagttctc tcagggggac cccatgagag 
25 L S> 

> 

PFV SPNG FQE SPP GV> 
f TAIL f f > 



30 160 170 18*0 190 200 

tttgcgccta tccgaacctc tagttacctc caatggcatg cctgacgtag 
aaacgcggat aggcttggag atcaatggag gttaccgtac ggactgcatc 

LRL SEP LVTS NGM> 
d REPEAT 1 d > 

35 P D V> 
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210 220 230 240 250 
caagcttacg acaacaggta gaagccttgc aagggcaggt acaacactta 
5 gttcgaatgc tgttgtccat cttcggaacg ttcccgtcca tgttgtgaat 
ASLR QQV E A L QGQV QHL> 
LUNG SURFACTANT PROTEIN D TRIM g > 

260 270 280 290 300 

10 caggcggcat ttagccaata caaaaaggta gagttgtttc caaacggagc 

gtccgccgta aatcggttat gtttttccat ctcaacaaag gtttgcctcg 

A> 
> 

Q A A FSQY KKV ELF PNG> 
15 LUNG SURFACTANT PROTEIN D TRIM g > 



>Clal 



310 320 330 340 350 

20 caagaagctg aacgacgccc aggcccccaa gagcgaccca tcgatcgaca 

gttcttcgac ttgctgcggg tccgggggtt ctcgctgggt agctagctgt 

KKL NDA QAPK SD> 
b SPA 4 8-60 b > 

P S I> 

25 c > 

D> 
> 

360 370 380 390 400 

3 0 ttgtgatgac ccagtctcaa agattcatgt ccacaacagt aggagacagg 

aacactactg ggtcagagtt tctaagtaca ggtgttgtca tcctctgtcc 
IVMT QSQ RFM STTV GDR> 
j j VK j j > 

35 410 420 430 440 450 
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gtcagcatca cctgcaaggc cagtcagaat gtggtttctg ctgttgcctg 
cagtcgtagt ggacgttccg gtcagtctta caccaaagac gacaacggac 
V S I TCKA SQN VVS AVAW> 

j j VK j j > 

5 460 470 480 490 500 

gtatcaacag aaaccaggac aatctcctaa actactgatt tactcagcat 
catagttgtc tttggtcctg ttagaggatt tgatgactaa atgagtcgta 

YQQ KPG QSPK LLI YSA> 
j j VK j j > 

10 

510 520 530 540 550 
ccaatcggta cactggagtc cctgatcgct tcacaggcag tggatctggg 
ggttagccat gtgacctcag ggactagcga agtgtccgtc acctagaccc 
SNRY TGV PDR F T G S GSG> 
1 5 j j VK j j > 

560 570 580 590 600 

acagatttca ctctcaccat tagcaatatg cagtctgaag acctggctga 
tgtctaaagt gagagtggta atcgttatac gtcagacttc tggaccgact 
20 TDF TLTI SNM QSE DLAD> 

j j VK j j > 

610 620 630 640 650 

ttttttctgt caacaatata gcaactatcc gtggacgttc ggtggaggca 
25 aaaaaagaca gttgttatat cgttgatagg cacctgcaag ccacctccgt 

F F C QQY SNYP WTF GGG> 
j j VK j j > 

660 670 680 690 700 

30 ccaagctgga aatcaaagga tctggctcta cttccggtag cggcaaatcc 

ggttcgacct ttagtttcct agaccgagat gaaggccatc gccgtttagg 
T K L E I K> 
VKj > 

G SGS TSGS GKS> 
35 k LINK k > 
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710 720 730 740 750 

tctgaaggca aaggtactag agacgtgaag ctcgtggagt ctgggggagg 
agacttccgt ttccatgatc tctgcacttc gagcacctca gaccccctcc 

DVK LVE SGGG> 
h VH h > 

S E G K G T R> 
LINK k > 



760 770 780 790 800 

cttagtgaag cttggagggt ccctgaaact ctcctgtgca gcctctggat 
gaatcacttc gaacctccca gggactttga gaggacacgt cggagaccta 

LVK LGG SLKL S C A A S G> 
h h VH h h > 



810 820 830 840 850 
tcactttcag taactattac atgtcttggg ttcgccagac tccagagaag 
agtgaaagtc attgataatg tacagaaccc aagcggtctg aggtctcttc 
FTFS NYY M S W VRQT PEK> 
h h VH h h > 



860 870 880 890 900 

aggctggagt tggtcgcagc cattaatagt gatggtggta tcacctacta 
tccgacctca accagcgtcg gtaattatca ctaccaccat agtggatgat 

RLE L V A A INS DGG I T Y Y> 
h h VH h h > 



910 920 930 940 950 

tctagacact gtgaagggcc gattcaccat ttcaagagac aatgccaaga 
agatctgtga cacttcccgg ctaagtggta aagttctctg ttacggttct 
30 LDT VKG RFTI SRD NAK> 

h h VH h h > 



960 970 980 990 1000 

acaccctgta cctgcaaatg agcagtctga agtctgagga cacagccttg 
tgtgggacat ggacgtttac tcgtcagact tcagactcct gtgtcggaac 



15 



20 
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NTLY LQM SSL KSED T A L> 
h h VH h h > 



1010 1020 1030 1040 1050 

ttttactgtg caagacaccg ctcgggctac ttttctatgg actactgggg 
aaaatgacac gttctgtggc gagcccgatg aaaagatacc tgatgacccc 
F Y C A R H R S G Y F S M D Y W G> 
h h VH h h > 



1060 1070 1080 1090 1100 

tcaaggaacc tcagtcaccg tctcctcatg cccaccgtgc ccagcacctg 
agttccttgg agtcagtggc agaggagtac gggtggcacg ggtcgtggac 

QGT SVT VSS> 
h VH h > 

A P> 
> 

C P P C P> 
1 HINGE 1 > 

1110 1120 1130 1140 1150 

aactcctagg gggaccgtca gtcttcctct tccccccaaa acccaaggac 
ttgaggatcc ccctggcagt cagaaggaga aggggggttt tgggttcctg 
ELLG GPS V F L FPPK PKD> 
i i CH2 i i > 



1160 1170 1180 1190 1200 

accctcatga tctcccggac ccctgaggtc acatgcgtgg tggtggacgt 
tgggagtact agagggcctg gggactccag tgtacgcacc accacctgca 
TLM ISRT PEV TCV VVDV> 
i i CH2 i i > 



1210 1220 1230 1240 1250 

gagccacgaa gaccctgagg tcaagttcaa ctggtacgtg gacggcgtgg 

ctcggtgctt ctgggactcc agttcaagtt gaccatgcac ctgccgcacc 

SHE DPE VKFN WYV DGV> 
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CH2 



1260 1270 1280 1290 1300 
aggtgcataa tgccaagaca aagccgcggg aggagcagta caacagcacg 
5 tccacgtatt acggttctgt ttcggcgccc tcctcgtcat gttgtcgtgc 
EVHN AKT KPR EEQY NST> 
i i CH2 i i > 

/ 1310 1320 1330 1340 1350 

10 taccgggtgg tcagcgtcct caccgtcctg caccaggact ggctgaatgg 

atggcccacc agtcgcagga gtggcaggac gtggtcctga ccgacttacc 

YRV VSVL TVL HQD WLNG> 
i i CH2 i__ i > 

15 1360 1370 1380 1390 1400 

caaggagtac aagtgcaagg tctccaacaa agccctccca gcccccatcg 
gttcctcatg ttcacgttcc agaggttgtt tcgggagggt cgggggtagc 

KEY KCK VSNK ALP API> 
i i CH2 i i > 

20 

>Xhol 
I 

11410 
agtaactcga g 
25 tcattgagct c 

* L E> 

a > 



30 



E> 



[SEQ. ID. NO.: 11] - Fiber Construct A7 G250 



35 Sequence Range: 1 to 1702 
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>EcoRl 
1 

| 10 20 30 40 50 

gaattcgatg aagcgcgcaa gaccgtctga agataccttc aaccccgtgt 
cttaagctac ttcgcgcgtt ctggcagact tctatggaag ttggggcaca 
M> 
_> 

M KRA RPSE DTF NPV> 
m m TAIL m > 



60 70 80 90 100 
atccatatga cacggaaacc ggtcctccaa ctgtgccttt tcttactcct 
taggtatact gtgcctttgg ccaggaggtt gacacggaaa agaatgagga 
YPYD TET GPP TVPF LTP> 
m m T A I L m m > 



110 120 130 140 150 

ccctttgtat cccccaatgg gtttcaagag agtccccctg gggtactctc 
gggaaacata gggggttacc caaagttctc tcagggggac cccatgagag 
20 L S> 

> 

P F V SPNG FQE SPP GV> 
m TAIL m m > 



'25 160 170 180 190 200 

tttgcgccta tccgaacctc tagttacctc caatggcatg cttgcgctca 
aaacgcggat aggcttggag atcaatggag gttaccgtac gaacgcgagt 

L A L> 
> 

30 LRL SEP LVTS NGM> 

k REPEAT 1 k > 



10 



35 



210 220 230 240 250 

aaatgggcaa cggcctctct ctggacgagg ccggcaacct tacctcccaa 
tttacccgtt gccggagaga gacctgctcc ggccgttgga atggagggtt 
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L T S Q> 
i > 

KMGN GLS LDE A G N> 
j REPEAT 2 j > 

5 

260 270 280 290 300 

aatgtaacca ctgtgagccc acctctcaaa aaaaccaagt caaacataaa 
ttacattggt gacactcggg tggagagttt ttttggttca gtttgtattt 

I N> 

10 > 

NVT TVSP PLK KTK SN> 
i REPEAT 3 i i > 



310 320 330 340 350 

cctggaaata tctgcacccc tcacagttac ctcagaagcc ctaactgtgg 
15 ggacctttat agacgtgggg agtgtcaatg gagtcttcgg gattgacacc 

L T V> 
> 

LEI SAP LTVT SEA> 
h REPEAT 4 h > 

20 

360 370 380 390 400 

ctgccgccgc acctctaatg gtcgcgggca acacactcac catgcaatca 
gacggcggcg tggagattac cagcgcccgt tgtgtgagtg gtacgttagt 

L T M Q S> 

25 REPEAT 6 > 

AAAA PLM V A G NT> 
g REPEAT 5 g > 

410 420 430 440 450 

30 caggccccgc taaccgtgca cgactccaaa cttagcattg ccacccaagg 

gtccggggcg attggcacgt gctgaggttt gaatcgtaac ggtgggttcc 

LSI A T Q G> 
REPEAT 7 > 

QAP LTVH DSK> 
35 REPEAT 6 f > 
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460 470 480 490 500 

acccctcaca gtgtcagaag gaaagctagc ccctgacgta gcaagcttac 
tggggagtgt cacagtcttc ctttcgatcg gggactgcat cgttcgaatg 

L A> 

5 ci_> 

P L T V S E G K> 
REPEAT 7 e > 

P D V A S L> 
LUNG SURFACTA > 

10 

510 520 530 540 550 
gacaacaggt agaagccttg caagggcagg tacaacactt acaggcggca 
ctgttgtcca tcttcggaac gttcccgtcc atgttgtgaa tgtccgccgt 
RQQV EAL QGQ V Q H L QAA> 
15 LUNG SURFACTANT PROTEIN D TRIM n > 

560 570 580 590 600 

tttagccaat acaaaaaggt agagttgttt ccaaacggag ccaagaagct 
aaatcggtta tgtttttcca tctcaacaaa ggtttgcctc ggttcttcga 
20 A K K L> 

_b > 

F S Q Y K K V ELF P N G> 
LUNG SURFACTANT PROTEIN D TRIM > 

25 >Clal 

I 

610 620 630 | 640 650 

gaacgacgcc caggccccca agagcgaccc atcgatcgac attgtgatga 
cttgctgcgg gtccgggggt tctcgctggg tagctagctg taacactact 
30 NDA QAP KSD> 

SPA 4 8-60 b > 

P S I> 
c > 

D I V M> 
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660 670 680 690 700 
cccagtctca aagattcatg tccacaacag taggagacag ggtcagcatc 
gggtcagagt ttctaagtac aggtgttgtc atcctctgtc ccagtcgtag 
TQSQ RFM STT VGDR VSI> 
5 q q VK q q > 

710 720 730 740 750 

acctgcaagg ccagtcagaa tgtggtttct gctgttgcct ggtatcaaca 
tggaegttcc ggtcagtctt acaccaaaga cgacaacgga ccatagttgt 
10 TCK ASQN V V S A V A WYQQ> 

q q VK q q > 

760 770 780 790 800 

gaaaccagga caatctccta aactactgat ttactcagca tccaatcggt 
15 ctttggtcct gttagaggat ttgatgacta aatgagtcgt aggttagcca 

KPG QSP KLLI YSA S N R> 
q q VK q q > 

810 820 830 840 850 

2 0 acactggagt ccctgatcgc ttcacaggca gtggatctgg gacagatttc 

tgtgacctca gggactagcg aagtgtccgt cacctagacc ctgtctaaag 
YTGV PDR FTG SGSG TDF> 
q q VK q q > 

25 860 870 880 890 900 

actctcacca ttagcaatat gcagtctgaa gacctggctg attttttctg 
tgagagtggt aatcgttata cgtcagactt ctggaccgac taaaaaagac 

TLT ISNM QSE D L A D F F C> 
q q VK q q > 

30 

910 920 930 940 950 

tcaacaatat agcaactatc cgtggacgtt cggtggaggc accaagctgg 
agttgttata tcgttqatag gcacctgcaa gccacctccg tggttcgacc 
QQY SNY PWTF GGG T K L> 
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960 970 980 990 1000 

aaatcaaagg atctggctct acttccggta gcggcaaatc ctctgaaggc 

tttagtttcc tagaccgaga tgaaggccat cgccgtttag gagacttccg 
E I K> 

5 > 

G SGS TSG SGKS SEG> 

r r LINK r > 



1010 1020 1030 1040 1050 
aaaggtacta gagacgtgaa gctcgtggag tctgggggag gcttagtgaa 
tttccatgat ctctgcactt cgagcacctc agaccccctc cgaatcactt 
DVK L V E SGG G L V K> 
o VH o > 

K G T R> 

r > 

1060 1070 1080 1090 1100 

gcttggaggg tccctgaaac tctcctgtgc agcctctgga ttcactttca 
cgaacctccc agggactttg agaggacacg tcggagacct aagtgaaagt 
LGG SLK LSCA ASG FTF> 

o o VH o o > 



1110 1120 1130 1140 1150 

gtaactatta catgtcttgg gttcgccaga ctccagagaa gaggctggag 
cattgataat gtacagaacc caagcggtct gaggtctctt ctccgacctc 
25 SNYY MSW VRQ TPEK RLE> 

o o VH o o > 



1160 1170 1180 1190 1200 

ttggtcgcag ccattaatag tgatggtggt atcacctact atctagacac 
30 aaccagcgtc ggtaattatc actaccacca tagtggatga tagatctgtg 

L V A AINS DGG ITY YLDT> 
o o VH o o > 



10 



15 



20 



35 



1210 1220 1230 1240 1250 

tgtgaagggc cgattcacca tttcaagaga caatgccaag aacaccctgt 
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acacttcccg gctaagtggt aaagttctct gttacggttc ttgtgggaca 
VKG RFT ISRD NAK N T L> 

o o VH o o > 



5 1260 1270 1280 1290 1300 
acctgcaaat gagcagtctg aagtctgagg acacagcctt gttttactgt 
tggacgttta ctcgtcagac ttcagactcc tgtgtcggaa caaaatgaca 
Y L Q M SSL KSE DTAL F Y C> 
: o o VH o o > 

10 

1310 1320 1330 1340 1350 

gcaagacacc gctcgggcta cttttctatg gactactggg gtcaaggaac 
cgttctgtgg cgagcccgat gaaaagatac ctgatgaccc cagttccttg 
ARH RSGY F S M D Y W GQGT> 
15 o oVHo o > 



1360 1370 1380 1390 1400 

ctcagtcacc gtctcctcat gcccaccgtg cccagcacct gaactcctag 
gagtcagtgg cagaggagta cgggtggcac gggtcgtgga cttgaggatc 
20 S V T V S S> 

VH > 

A P E L L> 
CH2 > 

C P P C P> 
25 _s HINGE s > 

1410 1420 1430 1440 1450 

ggggaccgtc agtcttcctc ttccccccaa aacccaagga caccctcatg 
cccctggcag tcagaaggag aaggggggtt ttgggttcct gtgggagtac 
30 GGPS VFL FPP KPKD TLM> 

p p C H 2 p p > 



35 



1460 1470 1480 1490 1500 

atctcccgga cccctgaggt cacatgcgtg gtggtggacg tgagccacga 
tagagggcct ggggactcca gtgtacgcac caccacctgc actcggtgct 
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I S R TPEV TCV VVD VSHE> 
p p CH2 p p > 



1510 1520 1530 1540 1550 

agaccctgag gtcaagttca actggtacgt ggacggcgtg gaggtgcata 
tctgggactc cagttcaagt tgaccatgca cctgccgcac ctccacgtat 

DPE VKF NWYV DGV EVH> 
p p C H 2 p p > 



1560 1570 1580 1590 1600 
atgccaagac aaagccgcgg gaggagcagt acaacagcac gtaccgggtg 
tacggttctg tttcggcgcc ctcctcgtca tgttgtcgtg catggcccac 
N A K T KPR EEQ YNST YRV> 
p p CH 2 p p ■ > 



1610 1620 1630 1640 1650 

gtcagcgtcc tcaccgtcct gcaccaggac tggctgaatg gcaaggagta 
cagtcgcagg agtggcagga cgtggtcctg accgacttac cgttcctcat 

V S V LTVL HQD WLN GKEY> 
p p CH 2 p p > 



>Xhol 
I 

1660 1670 1680 1690 1700 

caagtgcaag gtctccaaca aagccctccc agcccccatc gagtaactcg 
gttcacgttc cagaggttgt ttcgggaggg tcgggggtag ctcattgagc 

* L> 



KCK VSN K A L P API E> 
p CH2 p p > 



ag 
tc 
E> 
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[SEQ . ID. NO. : 12] - Fiber Construct Al C242 



Sequence Range: 1 to 1111 



>EcoRl 



I - 10 20 30 40 50 

10 gaattcgatg aagcgcgcaa gaccgtctga agataccttc aaccccgtgt 

cttaagctac ttcgcgcgtt ctggcagact tctatggaag ttggggcaca 
M> 
> 

M KRA RPSE DTF N P V> 

1 5 o o TA I L o > 

60 70 80 90 100 

atccatatga cacggaaacc ggtcctccaa ctgtgccttt tcttactcct 

taggtatact gtgcctttgg ccaggaggtt gacacggaaa agaatgagga 

2 0 YPYD T E T GPP TVPF LTP> 

o o TAIL o o > 



110 120 130 140 150 

ccctttgtat cccccaatgg gtttcaagag agtccccctg gggtactctc 

2 5 gggaaacata gggggttacc caaagttctc tcagggggac cccatgagag 

L S> 

> 

P E V SPNG FQE SPP GV> 
o TAIL o o > 

30 

160 170 180 190 200 

tttgcgccta tccgaacctc tagttacctc caatggcatg cctgacgtag 
aaacgcggat aggcttggag atcaatggag gttaccgtac ggactgcatc 
LRL SEP LVTS NGM> 

3 5 m REPEAT 1 m > 
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P D V> 
> 

210 220 230 240 250 
caagcttacg acaacaggta gaagccttgc aagggcaggt acaacactta 
gttcgaatgc tgttgtccat cttcggaacg ttcccgtcca tgttgtgaat 
ASLR QQV EAL QGQV QHL> 
LUNG SURFACTANT PROTEIN D TRIM p > 

260 270 280 290 300 

caggcggcat ttagccaata caaaaaggta gagttgtttc caaacggagc 
gtccgccgta aatcggttat gtttttccat ctcaacaaag gtttgcctcg 

A> 
> 

Q A A F S Q Y KKV ELF PNG> 
LUNG SURFACTANT PROTEIN D TRIM p > 

>Clal 
I 

310 ' ' 320 330 340 350 

caagaagctg aacgacgccc aggcccccaa gagcgaccca tcgatcgata 
gttcttcgac ttgctgcggg tccgggggtt ctcgctgggt agctagctat 

KKL NDA QAPK SD> 
b SPA 4 8-60 b > 

P S I> 
c > 

D> 
> 

360 370 380 390 400 

ttgtgatgac tcaggctgca ccctctgtac ctgtcactcc tggagagtca 
aacactactg agtccgacgt gggagacatg gacagtgagg acctctcagt 
IVMT Q A A PSV PVTP GES> 
e VK PHARMACIA e > 
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410 420 430 440 450 

gtatccatct cctgcaggtc tagtaagagt ctcctgcata gtaatggcaa 
cataggtaga ggacgtccag atcattctca gaggacgtat cattaccgtt 
VSI SCRS SKS LLH SNGN> 

e VK PHARMACIA e > 

S LLH SNGN> 
g CDR1 > 

' 460 470 480 490 500 

cacttacttg tattggttcc tgcagaggcc aggccagtct cctcagctcc 
gtgaatgaac ataaccaagg acgtctccgg tccggtcaga ggagtcgagg 

TYL Y W F LQRP GQS PQL> 
e VK PHARMACIA e 

TYL Y> 

g > 



510 520 530 540 550 

tgatatatcg gatgtccaac cttgtctcag gagtcccaga caggttcagt 
actatatagc ctacaggttg gaacagagtc ctcagggtct gtccaagtca 
LIYR MSN LVS GVPD RFS> 

e VK PHARMACIA e > 

R MSN L V S> 
f CDR2 > 

560 570 580 590 600 

ggcagtgggt caggaactgc tttcacactg agaatcagta gagtggaggc 
ccgtcaccca gtccttgacg aaagtgtgac tcttagtcat ctcacctccg 
GSG SGTA FTL RIS R V E A> 
e VK PHARMACIA e _____ > 



610 620 630 640 650 

tgaggatgtg ggtgtttatt actgtctgca acatctagag tatccgttca 
actcctacac ccacaaataa tgacagacgt tgtagatctc ataggcaagt 

LQ HLE YPF> 
" d CDR3 d > 
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EDV GVY YCLQ HLE YPF> 

e VK PHARMACIA e > 



660 670 680 690 700 

5 cgttcggtcc tgggaccaag etggagctga aacggccccc ggactttgtt 

gcaagccagg accctggttc gacctcgact ttgccggggg cctgaaacaa 
T> 
_> 

TFGP GTK LEL KR> 
10 e_VK PHARMACIA e > 

P P D F V> 
825 > 



710 720 730 740 750 

15 cccccggccg ctagtttccc tgatcactcc cctcgtggcc aggtccagtt 

gggggccggc gatcaaaggg actagtgagg ggagcaccgg tccaggtcaa 

Q V Q L> 
_i > 

PPA ASFP DHS PRO 
20 1 825 1 > • 



760 770 780 790 800 

ggtgcagtct ggacctgagc tgaagaagcc tggagagaca gtcaagatct 
ccacgtcaga cctggactcg acttcttcgg acctctctgt cagttctaga 
25 VQS GPE LKKP GET VKI> 

i VH PHARMACIA [SPLIT] i > 

810 820 830 840 850 

cctgcaaggc ttctgattat accttcacat actatggaat gaactgggtg 
ggacgttccg aagactaata tggaagtgta tgatacctta cttgacccac 
30 SCKA S D Y TFT YYGM NWV> 

i VH PHARMACIA [SPLIT] i > 

YYGM N> 
k CDR1 k > 



35 



860 



870 



880 890 900 
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aagcaggctc cgggaaaggg tttaaagtgg atgggctgga tagacaccac 
ttcgtccgag gccctttccc aaatttcacc tacccgacct atctgtggtg 
K Q A PGKG LKW M G W I D T T> 

i VH PHARMACIA [SPLIT] i > 

5 W I D T T> 

CDR2 [SP > 

910 920 930 940 950 

cactg / gagag ccaacatatg ctgaagattt taagggacgg attgccttct 
10 gtgacctctc ggttgtatac gacttctaaa attccctgcc taacggaaga 

TGE PTY AEDF KGR IAF> 

i VH PHARMACIA [SPLIT] i > 

TGE PTY AEDF KG> 
j CDR2 [SPLIT] j > 



15 

960 970 980 990 1000 
ctttggagac ctctgccagc actgcctatt tgcagatcaa aaacctcaaa 
gaaacctctg gagacggtcg tgacggataa acgtctagtt tttggagttt 
S-LET SAS TAY LQIK NLK> 
2 0 i VH PHARMACIA [SPLIT] i > 

1010 1020 1030 1040 1050 

aatgaggaca cggctacata tttctgtgca agacgggggc cttacaactg 
ttactcctgt gccgatgtat aaagacacgt tctgcccccg gaatgttgac 
25 R G P Y N W> 

CDR3 > 

NED TATY F C A RRG PYNW> 
i VH PHARMACIA [SPLIT] i > 



1060 1070 1080 1090 1100 

3 0 gtactttgat gtctggggcc aagggaccac ggtcaccgtc tcctcactcg 

catgaaacta cagaccccgg ttccctggtg ccagtggcag aggagtgagc 

L> 
> 

Y F D V> 
35 h > 
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YFD V W G QGTT VTV SS> 
i VH PHARMACIA [SPLIT]___ i > 



>Xhol 
I 

11110 
attaactcga g 
taattgagct c 
D * L E> 

a > 



[SEQ. ID. NO.: 13] - Fiber Construct A7 C242 

15 

Sequence Range: 1 to 1402 

>EcoRl 
I 

I 10 20 30 40 50 
gaattcgatg aagcgcgcaa gaccgtctga agataccttc aaccccgtgt 
cttaagctac ttcgcgcgtt ctggcagact tctatggaag ttggggcaca 
M> 
> 

M KRA RPSE DTF NPV> 
v v TAIL v > 



60 70 80 90 100 

atccatatga cacggaaacc ggtcctccaa ctgtgccttt tcttactcct 
30 taggtatact gtgcctttgg ccaggaggtt gacacggaaa agaatgagga 

YPYD TET GPP TVPF LTP> 
v v TAIL v v > 



20 



25 



35 



110 120 130 140 150 

ccctttgtat cccccaatgg gtttcaagag agtccccctg gggtactctc 
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gggaaacata gggggttacc caaagttctc tcagggggac cccatgagag 

L S> 



P F V SPNG FQE SPP GV> 
5 v TAIL v v > 



160 170 180 190 200 

tttgcgccta tccgaacctc tagttacctc caatggcatg cttgcgctca 

aaaccfcggat aggcttggag atcaatggag gttaccgtac gaacgcgagt 

10 L A L> 



LRL SEP LVTS N G M> 
t REPEAT It > 



15 210 220 230 240 250 

aaatgggcaa cggcctctct ctggacgagg ccggcaacct tacctcccaa 
tttacccgtt gccggagaga gacctgctcc ggccgttgga atggagggtt 

L T S Q> 
r > 

20 KMGN GLS LDE AGN> 

s REPEAT 2 s > 



260 270 280 290 300 

aatgtaacca ctgtgagccc acctctcaaa aaaaccaagt caaacataaa 
25 ttacattggt gacactcggg tggagagttt ttttggttca gtttgtattt 

I N> 
> 

NVT TVSP PLK KTK SN> 
r REPEAT 3 r r > 

30 

310 320 330 340 350 

cctggaaata tctgcacccc tcacagttac ctcagaagcc ctaactgtgg 
ggacctttat agacgtgggg agtgtcaatg gagtcttcgg gattgacacc 

L T V> 

35 > 
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LEI SAP LTVT S E A> 
q REPEAT 4 q > 



360 370 380 390 400 

ctgccgccgc acctctaatg gtcgcgggca acacactcac catgcaatca 
gacggcggcg tggagattac cagcgcccgt tgtgtgagtg gtacgttagt 

L T M Q S> 
REPEAT 6 > 

A A A A P L M VAG NT> 
p REPEAT 5 p > 



410 420 430 440 450 

caggccccgc taaccgtgca cgactccaaa cttagcattg ccacccaagg 

gtccggggcg attggcacgt gctgaggttt gaatcgtaac ggtgggttcc 

LSI A T Q G> 
REPEAT 7 > 

QAP LTVH DSK> 
REPEAT 6 o > 



460 470 480 4 90 500 

acccctcaca gtgtcagaag gaaagctagc ccctgacgta gcaagcttac 
tggggagtgt cacagtcttc ctttcgatcg gggactgcat cgttcgaatg 

L A> 
m > 

PLT VSE GK> 
REPEAT 7 n > 

P D V A S L> 
LUNG SURFACTA > 



510 520 530 540 • 550 
gacaacaggt agaagccttg caagggcagg tacaacactt acaggcggca 
ctgttgtcca tcttcggaac gttcccgtcc atgttgtgaa tgtccgccgt 
RQQV EAL QGQ VQHL QAA> 
LUNG SURFACTANT PROTEIN D TRIM w > 

560 ' - 570 580 590 600 
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tttagccaat acaaaaaggt agagttgttt ccaaacggag ccaagaagct 
aaatcggtta tgtttttcca tctcaacaaa ggtttgcctc ggttct tcga 

A K K L> 
_b > 

F S Q YKKV ELF PNG> 
LUNG SURFACTANT PROTEIN D TRIM > 

>Clal 
I 

610 620 630 | 640 650 

gaacgacgcc caggccccca agagcgaccc atcgatcgat attgtgatga 
cttgctgcgg gtccgggggt tctcgctggg tagctagcta taacactact 

N D A QAP KSD> 
SPA 4 8-60 b > 

P S I> 
c > 

D I V M> 
e > 

660 670 680 690 700 
ctcaggctgc accctctgta cctgtcactc ctggagagtc agtatccatc 
gagtccgacg tgggagacat ggacagtgag gacctctcag tcataggtag 
TQAA PSV PVT P G E S VSI> 
e VK PHARMACIA e > 



710 720 730 740 750 

tcctgcaggt ctagtaagag tctcctgcat agtaatggca acacttactt 
aggacgtcca gatcattctc agaggacgta tcattaccgt tgtgaatgaa 
SCR SSKS LLH SNG NTYL> 

e VK PHARMACIA e > 

S LLH SNG NTYL> 
g g CDR1 g > 



760 770 780 790 800 

gtattggttc ctgcagaggc caggccagtc tcctcagctc ctgatatatc 
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cataaccaag gacgtctccg gtccggtcag aggagtcgag gactatatag 
YWF LQR PGQS PQL LIY> 

e VK PHARMACIA e > 

_> 

Y> 

> 



810 820 830 840 850 

ggatgtccaa ccttgtctca ggagtcccag acaggttcag tggcagtggg 
10 cctacaggtt ggaacagagt cctcagggtc tgtccaagtc accgtcaccc 

R M S N LVS GVP DRFS GSG> 

e VK PHARMACIA e > 

R M S N L V S> 
CDR2 > 

15 

860 870 880 890 900 
tcaggaactg ctttcacact gagaatcagt agagtggagg ctgaggatgt 
agtccttgac gaaagtgtga ctcttagtca tctcacctcc gactcctaca 
SGT AFTL RIS RVE A E D V> 
2 0 e VK PHARMACIA e- > 

910 920 930 940 950 

gggtgtttat tactgtctgc aacatctaga gtatccgttc acgttcggtc 
cccacaaata atgacagacg ttgtagatct cataggcaag tgcaagccag 
25 L QHLE YPF T> 

d CDR3 d > 

GVY YCL QHLE YPF TFG> 

e VK PHARMACIA e > 



30 960 970 980 990 1000 

ctgggaccaa gctggagctg aaacggcccc cggactttgt tcccccggcc 
gaccctggtt cgacctcgac tttgccgggg gcctgaaaca agggggccgg 
PGTK LEL KR> 
VK PHARMACIA e > 

35 ' P P D F V P P A> 
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1 825 1 > 



1010 1020 1030 1040 1050 

gctagtttcc ctgatcactc ccctcgtggc caggtccagt tggtgcagtc 
5 cgatcaaagg gactagtgag gggagcaccg gtccaggtca accacgtcag 

Q V Q L V Q S> 
VH PHARMACIA [ > 

ASF PD HS PRO 
1 8 2 5 1 > 

10 

1060 1070 1080 1090 1100 
tggacctgag ctgaagaagc ctggagagac agtcaagatc tcctgcaagg 
■ acctggactc gacttcttcg gacctctctg tcagttctag aggacgttcc 
G P E L K K PGET VKI SCK> 
15 i VH PHARMACIA [SPLIT] i > 



1110 1120 1130 1140 1150 

cttctgatta taccttcaca tactatggaa tgaactgggt gaagcaggct 
gaagactaat atggaagtgt atgatacctt acttgaccca cttcgtccga 
20 A S D Y TFT YYG M N W V ' K Q A> 

i VH PHARMACIA [SPLIT] i > 

YYG M N> 
CDR1 k > 



25 1160 1170 1180 1190 1200 

ccgggaaagg gtttaaagtg gatgggctgg atagacacca ccactggaga 
ggccctttcc caaatttcac ctacccgacc tatctgtggt ggtgacctct 
PGK GLKW MGW IDT TTGE> 

i VH PHARMACIA [SPLIT] i > 

30 W IDT TTGE> 

j_CDR2 [SPLIT] > 



35 



1210 1220 1230 1240 1250 

gccaacatat gctgaagatt ttaagggacg gattgccttc tctttggaga 
cggttgtata cgacttctaa aattccctgc ctaacggaag agaaacctct 



PCT/GBO 1/03252 



128 



PTY AED FKGR I A F S L E> 
i VH PHARMACIA [SPLIT] i > 

PTY AED FKG> 
CDR2 [SPLIT]_j > 

1260 1270 1280 1290 1300 
cctctgccag cactgcctat ttgcagatca aaaacctcaa aaatgaggac 
ggagacggtc gtgacggata aacgtctagt ttttggagtt tttactcctg 
TSAS TAY L Q I KNLK NED> 
i VH PHARMACIA [SPLIT] i , > 



1310 1320 1330 1340 1350 

acggctacat atttctgtgc aagacggggg ccttacaact ggtactttga 
tgccgatgta taaagacacg ttctgccccc ggaatgttga ccatgaaact 

R G P Y N W Y F D> 

h CDR3 h > 

TAT YFCA RRG PYN W Y F D> 
i VH PHARMACIA [SPLIT] i > 



>Xhol 
I 

1360 1370 1380 1390 1400 

tgtctggggc caagggacca cggtcaccgt ctcctcactc gattaactcg 
acagaccccg gttccctggt gccagtggca gaggagtgag ctaattgagc 

L D * L> 
a > 

V> 
> 

VWG QGT TVTV SS> 
VH PHARMACIA [SPLIT] i > 



ag 
tc 
E> 
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[SEQ. ID. NO. : 14] - 



5 Sequence Range: 1 to 174 - Affibody ZIgG 

10 20 30 40 50 

gtagacaaca aattcaacaa agaacaacaa aacgcgttct atgagatctt 
catctgttgt ttaagttgtt tcttgttgtt ttgcgcaaga tactctagaa 
10 VDN KFNK EQQ NAF YEIL> 

a a_Z-DOM?N a a > 

60 70 80 90 100 

acatttacct aacttaaacg aagaacaacg aaacgccttc atccaaagtt 
15 tgtaaatgga ttgaatttgc ttcttgttgc tttgcggaag taggtttcaa 

HLP NLN EEQR NAF IQS> 
a a 2 - DOM?N a a > 

110 120 130 140 150 

2 0 taaaagatga cccaagccaa agcgctaact tgctagcaga agctaaaaag 

attttctact gggttcggtt tcgcgattga acgatcgtct tcgatttttc 
LKDD.PSQ SAN LLAE A K K> 
a a _Z~ DOM?N a a > 

25 160 170 

ctaaatgatg ctcaggcgcc gaaa 
gatttactac gagtccgcgg cttt 
LND AQAP K> 
Z-DOM7N a > 

30 

[SEQ. ID. NO.: 15] - Affibody ZIgA 



35 Sequence Range: 1 to 174 

RECTIFIED SHEET (RULE 91) 
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10 20 30 40 50 

gtagacaaca aattcaacaa agaaacaata caagcgagtc aagagatcag 
catctgttgt ttaagttgtt tctttgttat gttcgctcag ttctctagtc 
VDN K F N K ETI Q A S QEIR> 
5 a AFFI IGA a a > 



60 70 80 90 100 

actattacct aacttaaacg gtagacaaaa gcttgccttc atccacagtt 
tgataatgga ttgaatttgc catctgtttt cgaacggaag taggtgtcaa 
10 LLP NLN GRQK LAF IHS> 

a AFFI IGA a a > 



110 120 130 140 150 
tacttgatga cccaagccaa agcgctaact tgctagcaga agctaaaaag 
atgaactact gggttcggtt tcgcgattga acgatcgtct tcgatttttc 
LLDD PSQ SAN LLAE A K K> 
a AFFI IGA a a > 



160 170 
20 ctaaatgatg ctcaggcgcc gaaa 

gatttactac gagtccgcgg cttt 
LND AQAP K> 
AFFI IGA a > 



25 

[SEQ. ID. NO. : 16] - Recombinant Fiber A7 ZIgG 



Sequence Range: 1 to 820 

30 

>EcoRl 
I 

I 10 20 30 40 50 

gaattcgatg aagcgcgcaa gaccgtctga agataccttc aaccccgtgt 
35 cttaagctac ttcgcgcgtt ctggcagact tctatggaag ttggggcaca 
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M> 

> 

M KRA RPSE DTF N P V> 
j j TAIL j > 

5 

60 70 80 90 100 
atccatatga cacggaaacc ggtcctccaa ctgtgccttt tcttactcct 
taggtatact gtgcctttgg ccaggaggtt gacacggaaa agaatgagga 
Y P..' Y D TET GPP T V P F LTP> 
1 0 j j _T AI L j j > 

110 120 130 140 150 

ccctttgtat cccccaatgg gtttcaagag agtccccctg gggtactctc 
gggaaacata gggggttacc caaagttctc tcagggggac cccatgagag 
15 L S> 

> 

PFV SPNG FQE SPP GV> 
j TA I L j j > 



20 160 170 180 190 200 

tttgcgccta tccgaacctc tagttacctc caatggcatg cttgcgctca 
aaacgcggat aggcttggag atcaatggag gttaccgtac gaacgcgagt 

L A L> 
> 

25 LRL SEP LVTS NGM> 

h REPEAT 1 h > 

210 220 230 240 250 

aaatgggcaa cggcctctct ctggacgagg ccggcaacct tacctcccaa 
3 0 tttacccgtt gccggagaga gacctgctcc ggccgttgga atggagggtt 

L T S Q> 
f > 

KMGN GLS LDE A G N> 
g REPEAT 2 g > 

35 
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260 270 280 290 300 

aatgtaacca ctgtgagccc acctctcaaa aaaaccaagt caaacataaa 
ttacattggt gacactcggg tggagagttt ttttggttca gtttgtattt 

I N> 



NVT TVSP PLK KTK SN> 
f REPEAT 3 f f > 



310 320 330 340 350 

10 cctggaaata tctgcacccc tcacagttac ctcagaagcc ctaactgtgg 

ggacctttat agacgtgggg agtgtcaatg gagtcttcgg gattgacacc 

L T V> 



LEI SAP LTVT SEA> 
15 e REPEAT 4 e > 



360 370 380 390 400 

ctgccgccgc acctctaatg gtcgcgggca acacactcac catgcaatca 
gacggcggcg tggagattac cagcgcccgt tgtgtgagtg gtacgttagt 
20 L T M Q S> 

REPEAT 6 > 

A A A A PLM VAG NT> 
d REPEAT 5 d > 



25 410 420 430 440 450 

caggccccgc taaccgtgca cgactccaaa cttagcattg ccacccaagg 
gtccggggcg attggcacgt gctgaggttt gaatcgtaac ggtgggttcc 

L S I A T Q G> 
REPEAT 7 > 

30 QAP LTVH DSK> 

REPEAT 6 c > 



35 



460 470 480 490 500 

acccctcaca gtgtcagaag gaaagctagc ccctgacgta gcaagcttac 
tggggagtgt cacagtcttc ctttcgatcg gggactgcat cgttcgaatg 
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L A> 
a__> 

PLT VSE GK> 
REPEAT 7 b > 

P D V A S L> 
LUNG SURFACTA > 

510 520 530 540 550 
gacaa'caggt agaagccttg caagggcagg tacaacactt acaggcggca 
ctgttgtcca tcttcggaac gttcccgtcc atgttgtgaa tgtccgccgt 
RQQV EAL QGQ VQHL Q A A> 
LUNG SURFACTANT PROTEIN D TRIM k > 



560 570 580 590 600 

tttagccaat acaaaaaggt agagttgttt ccaaacggag ccaagaagct 
aaatcggtta tgtttttcca tctcaacaaa ggtttgcctc ggttcttcga 

FSQ YKKV ELF PNG> 
LUNG SURFACTANT PROTEIN D TRIM > 

A K K L> 



>Clal 
I 

610 620 630 | 640 650 

25 gaacgacgcc caggccccca agagcgaccc atcgatcgta gacaacaaat 

cttgctgcgg gtccgggggt tctcgctggg tagctagcat ctgttgttta 

NDA QAP KSD> 
SPA 48-60 1 > 

P S I> 

3 0 _ m > 

V D N K> 

o > 



35 



660 670 680 690 700 

tcaacaaaga acaacaaaac gcgttctatg agatcttaca tttacctaac 
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agttgtttct tgttgttttg cgcaagatac tctagaatgt aaatggattq 
F N K E QQN A F Y EILH LPN> 
o o Z-DOM7N o o > 



5 710 720 730 740 750 

ttaaacgaag aacaacgaaa cgccttcatc caaagtttaa aagatgaccc 
aatttgcttc ttgttgcttt gcggaagtag gtttcaaatt ttctactggg 

LNE EQRN A F I QSL KDDP> 
o o_Z-DOM?N o o > 

10 

760 770 780 790 800 

aagccaaagc gctaacttgc tagcagaagc taaaaagcta aatgatgctc 
ttcggtttcg cgattgaacg atcgtcttcg atttttcgat ttactacgag 
SQS A N L LAEA KKL NDA> 

1 5 o o_Z - DOM?N o o > 

>Xhol 
I 

810 I 820 

aggcgccgaa ataactcgag 
20 tccgcggctt tattgagctc 

*> 
> 

Q A P K> 
0_> 

25 



fSEQ. ID. NO. : 17] - Recombinant Fiber A7 ZIgA 

30 

Sequence Range: 1 to 820 

>EcoRl 
I 

35 I 10 20 30 40 50 
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gaattcgatg aagcgcgcaa gaccgtctga agataccttc aaccccgtgt 
cttaagctac ttcgcgcgtt ctggcagact tctatggaag ttggggcaca 
M> 

M KRA RPSE DTF NPV> 
j j TA I L j 

60 70 80 90 100 
atcca/tatga cacggaaacc ggtcctccaa ctgtgccttt tcttactcct 
taggtatact gtgccttfcgg ccaggaggtt gacacggaaa agaatgagga 
YPYD TET GPP TVPF L T P> 
j j TAIL j j : 

110 120 130 140 150 

ccctttgtat cccccaatgg qtttcaagag agtccccctg gggtactctc 
gggaaacata gggggttacc caaagttctc tcagggggac cccatgagag 

L S 



PFV SPNG FQE SPP GV> 
j TAIL j j > 

160 170 180 190 200 

tttgcgccta tccgaacctc tagttacctc caatggcatg cttgcgctca 
aaacgcggat aggcttggag atcaatggag gttaccgtac gaacgcgagt 

L A L> 



LRL SEP LVTS NGM> 
h REPEAT 1 h ; . > 



210 220 230 240 250 

aaatgggcaa cggcctctct ctggacgagg ccggcaacct tacctcccaa 
tttacccgtt gccggagaga gacctgctcc ggccgttgga atggagggtt 

L T S Q> 
f 

KMGN GLS LDE AGN> 
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REPEAT 2 



260 270 280 290 300 

aatgtaacca ctgtgagccc acctctcaaa aaaaccaagt caaacataaa 
ttacattggt gacactcggg tggagagttt ttttggttca gtttgtattt 

I N> 
> 

NVT TVSP PLK KTK SN> 
f REPEAT 3 f f > 



10 

310 320 330 340 350 

cctggaaata tctgcacccc tcacagttac ctcagaagcc ctaactgtgg 
ggacctttat agacgtgggg agtgtcaatg gagtcttcgg gattgacacc 

L T V> 

15 > 

LEI SAP LTVT SEA> 
e REPEAT 4 e > 

360 370 380 390 400 

2 0 ctgccgccgc acctctaatg gtcgcgggca acacactcac catgcaatca 

gacggcggcg tggagattac cagcgcccgt tgtgtgagtg gtacgttagt 

L T M Q S> 

■ REPEAT 6 > 

A A A A PLM VAG NT> 
25 d REPEAT 5 d > 

410 420 430 440 450 

caggccccgc taaccgtgca cgactccaaa cttagcattg ccacccaagg 
gtccggggcg attggcacgt gctgaggttt gaatcgtaac ggtgggttcc 
30 LSI A T Q G> 

REPEAT 7 > 

QAP LTVH DSK> 
REPEAT 6 c > 

35 460 470 480 490 500 
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acccctcaca gtgtcagaag gaaagctagc ccctgacgta gcaagcttac 
tggggagtgt cacagtcttc ctttcgatcg gggactgcat cgttcgaatg 

L A> 
a__> 

5 P.LTVSEGK> 
REPEAT 7 b > 

P D V A S L> 
LUNG SURFACTA > 

10 510 520 530 540 550 

gacaacaggt agaagccttg caagggcagg tacaacactt acaggcggca 
ctgttgtcca tcttcggaac gttcccgtcc atgttgtgaa tgtccgccgt 
RQQV EAL QGQ VQHL Q A A> 
LUNG SURFACTANT PROTEIN D TRIM k > 

15 

560 570 580 590 600 
tttagccaat acaaaaaggt agagttgttt ccaaacggag ccaagaagct 
aaatcggtta tgtttttcca tctcaacaaa ggtttgcctc ggttcttcga 
FSQ YKKV ELF PNG> 
20 LUNG SURFACTANT PROTEIN D TRIM > 

A K K L> 
JL > 

>Clal 

25 | 

610 620 630 | 640 650 

gaacgacgcc caggccccca agagcgaccc atcgatcgta gacaacaaat 
cttgctgcgg gtccgggggt tctcgctggg tagctagcat ctgttgttta 
NDA QAP KSD> 
30 SPA 4 8-60___l > 

P S I> 
m > 

V D N K> 

O > 



35 
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660 670 680 690 700 
tcaacaaaga aacaatacaa gcgagtcaag agatcagact attacctaac 
agttgtttct ttgttatgtt cgctcaqttc tctagtctga taatggattg 
F N K E T I Q ASQ EIRL LPN> 
5 o AFFI IGA _o o > 

710 720 730 740 750 

ttaaacggta gacaaaagct tgccttcatc cacagtttac ttgatgaccc 
aatttgccat ctgttttcga acggaagtag gtgtcaaatg aactactggg 
10 LNGP. QKLAFIHSLLDDP> 

o AFFI IGA _o o > 

760 770 780 790 800 

aagccaaagc gctaacttgc tagcagaagc taaaaagcta aatgatgctc 
•15 ttcggtttcg cgattgaacg atcgtcttcg atttttcgat ttactacgag 

SQS ANL LAEA KKL NDA> 

o AFFI IGA o o > 



20 



30 



>Xhol 



810 | 820 

aggcgccgaa ataactcgag 
tccgcggctt tattgagctc 
*> 

25 > 

Q A P K> 

o > 



[SEQ. ID. NO. : 18] - Recombinant Fiber A7 ZIgG/ZIgA 



Sequence Range: 1 to 1003 

35 
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10 



>EcoRl 
I 

I 10 20 30 40 50 

gaattcgatg aagcgcgcaa gaccgtctga agataccttc aaccccgtgt 
cttaagctac ttcgcgcgtt ctggcagact tctatggaag ttggggcaca 
M> 

_> 

M KRA RPSE DTF NPV> 
' j j TAIL j > 



60 70 80 90 100 
atccatatga cacggaaacc ggtcctccaa ctgtgccttt tcttactcct 
taggtatact gtgcctttgg ccaggaggtt gacacggaaa agaatgagga 
YPYD TET GPP TVPF LTP> 
1 5 j j TAI L j j > 

110 120 130 140 150 

ccctttgtat cccccaatgg gtttcaagag agtccccctg gggtactctc 
gggaaacata gggggttacc caaagttctc tcagggggac cccatgagag 
20 L S> 

> 

PFV SPNG F Q E SPP GV> 
j TA I L j j > 



25 160 170 180 190 200 

tttgcgccta tccgaacctc tagttacctc caatggcatg cttgcgctca 
aaacgcggat aggcttggag atcaatggag gttaccgtac gaacgcgagt 

L A L> 
> 

30 LRL SEP LVTS NGM> 

h REPEAT 1 h > 



35 



210 220 230 240 250 

aaatgggcaa cggcctctct ctggacgagg ccggcaacct tacctcccaa 
tttacccgtt gccggagaga gacctgctcc ggccgttgga atggagggtt 
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L T S Q> 
f > 

KMGN GLS LDE A G N> 
g REPEAT 2 g > 

5 

260 270 280 290 300 

aatgtaacca ctgtgagccc acctctcaaa aaaaccaagt caaacataaa 
ttacattggt gacactcggg tggagagttt ttttggttca gtttgtattt 

I N> 

10 > 

NVT TVSP PLK KTK SN> 
f REPEAT 3 f f > 



310 320 330 340 350 

15 cctggaaata tctgcacccc tcacagttac ctcagaagcc ctaactgtgg 

ggacctttat agacgtgggg agtgtcaatg gagtcttcgg gattgacacc 

L T V> 
> 

LEI SAP LTVT SEA> 
2 0 e REPEAT 4 e > 

360 r 370 380 390 400 

ctgccgccgc acctctaatg gtcgcgggca acacactcac catgcaatca 

gacggcggcg tggagattac cagcgcccgt tgtgtgagtg gtacgttagt ' 

25 L T M Q S> 

REPEAT 6 > 

AAAA PLM VAG NT> 
d REPEAT 5 _d > 

30 410 420 430 440 450 

caggccccgc taaccgtgca cgactccaaa cttagcattg ccacccaagg 
gtccggggcg attggcacgt gctgaggttt gaatcgtaac ggtgggttcc 

LSI A T Q G> 
REPEAT 7 > 

35 QAP LTVH DSK> 
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REPEAT 6c > 



460 470 480 490 500 

acccctcaca gtgtcagaag gaaagctagc ccctgacgta gcaagcttac 
tggggagtgt cacagtcttc ctttcgatcg gggactgcat cgttcgaatg 

L A> 
a_> 

PLT VSE GK> 
REPEAT 7 b > 

P D V A S L> 
LUNG SURFACTA > 

510 520 530 540 550 
gacaacaggt agaagccttg caagggcagg tacaacactt acaggcggca 
ctgttgtcca tcttcggaac gttcccgtcc atgttgtgaa tgtccgccgt 
RQQV EAL QGQ VQHL QAA> 
LUNG SURFACTANT PROTEIN D TRIM k > 



560 570 580 590 600 

tttagccaat acaaaaaggt agagttgttt ccaaacggag ccaagaagct 
aaatcggtta tgtttttcca tctcaacaaa ggtttgcctc .ggttcttcga 

FSQ YKKV ELF PNG> 
LUNG SURFACTANT PROTEIN D TRIM > 

A K K L> 
_1 > 

>Clal 
t 

610 620 630 j 640 650 

gaacgacgcc caggccccca agagcgaccc atcgatcgta gacaacaaat 
cttgctgcgg gtccgggggt tctcgctggg tagctagcat ctgttgttta 

NDA QAP KSD> 
SPA 4 8-60 1 > 

P S I> 
m > 



r 



WO 02/08263 



PCT/GB01/03252 



- - 142 - - 



V D N K> 
o > 

660 670 680 690 700 
5 tcaacaaaga acaacaaaac gcgttctatg agatcttaca tttacctaac 
agttgtttct tgttgttttg cgcaagatac tctagaatgt aaatggattg 
FNKE QQN AFY EILH LPN> 
o o Z-DOM7N o o > 



10 710 720 730 740 750 

ttaaacgaag aacaacgaaa cgccttcatc caaagtttaa aagatgaccc 
aatttgcttc ttgttgcttt gcggaagtag gtttcaaatt ttctactggg 

LNE EQRN A F I QSL KDDP> 
o o_Z - DOM?N o o > 

15 

760 770 780 790 800 
aagccaaagc gctaacttgc tagcagaagc taaaaagcta aatgatgctc 
ttcggtttcg cgattgaacg atcgtcttcg atttttcgat ttactacgag 
SQS A N L L A E A KKL NDA> 
2 0 o o_Z-DOM?N o _o > 

810 820 830 840 850 

aggcgccgaa aagctcgacc gtagacaaca aattcaacaa agaaacaata 
tccgcggctt ttcgagctgg catctgttgt ttaagttgtt tctttgttat 
2 5 Q A P K> 

o_> 

S S T> 
> 

VDN KFNK ETI> 
30 AFFI IGA _q > 

860 870 880 890 900 

caagcgagtc aagagatcag actattacct aacttaaacg gtagacaaaa 

gttcgctcag ttctctagtc tgataatgga ttgaatttgc catctgtttt 

35 Q A S QEIR LLP NLN GRQK> 
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_AFFI IGA _q_ 



910 920 930 940 950 

gcttgccttc atccacagtt tacttgatga cccaagccaa agcgctaact 
5 cgaacggaag taggtgtcaa atgaactact gggttcggtt tcgcgattga 

LAF I H S LLDD PSQ SAN> 
q AFFI IGA _q q > 

>Xhol 

10 I 

960 970 980 990 1000 

tgctagcaga agctaaaaag ctaaatgatg ctcaggcgcc gaaataactc 
acgatcgtct tcgatttttc gatttactac gagtccgcgg ctttattgag 

*> 

15 > 

LLAE AKK LND AQAP K> 
q AFFI IGA q q > 

gag 

20 etc 



[SEQ. ID. NO.: 19] - Recombinant Fiber A7 ZIgG/ZIgG 

25 

Sequence Range: 1 to 1003 

>EcoRl 
30 | 

| 10 20 30 40 50 

gaattcgatg aagcgcgcaa gaeegtctga agataccttc aaccccgtgt 
cttaagctac ttcgcgcgtt ctggcagact tctatggaag ttggggcaca 
M> 
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M KRA RPSE DTF NPV> 
j j TAIL j > 

60 70 80 90 100 
5 atccatatga cacggaaacc ggtcctccaa ctgtgccttt tcttactcct 
taggtatact gtgcctttgg ccaggaggtt gacacggaaa agaatgagga 
YPYD TET GPP TVPF LTP> 
j j TAIL j j > 

10 110 120 130 140 150 

ccctttgtat cccccaatgg gtttcaagag agtccccctg gggtactctc 
gggaaacata gggggttacc caaagttctc tcagggggac cccatgagag 

L S> 



15 PFVSPNGFQESPPGV> 

j T A I L j j > 



160 170 180 190 200 

tttgcgccta tccgaacctc tagttacctc caatggcatg cttgcgctca 
2 0 aaacgcggat aggcttggag atcaatggag gttaccgtac gaacgcgagt 

L A L> 



LRL SEP LVTS NGM> 
h REPEAT 1 h > 

25 

210 220 230 240 250 

aaatgggcaa cggcctctct ctggacgagg ccggcaacct tacctcccaa 
tttacccgtt gccggagaga gacctgctcc ggccgttgga atggagggtt 

L T S Q> 

3 0 _f > 

KMGN GLS LDE A G N> 
g REPEAT 2 g_ > 



35 



260 270 280 290 300 

aatgtaacca ctgtgagccc acctctcaaa aaaaccaagt caaacataaa 
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ttacattggt gacactcggg tggagagttt ttttggttca gtttgtattt 

I N> 
> 

NVT TVSP PLK KTK SN> 
f REPEAT 3 f f > 



310 320 330 340 350 

cctggaaata tctgcacccc tcacagttac ctcagaagcc ctaactgtgg 
ggac£tttat agacgtgggq agtgtcaatg gagtcttcgg gattgacacc 
10 L T V> 

;> 

LEI SAP LTVT SEA> 
e REPEAT 4 e > 

15 360 370 380 390 400 

ctgccgccgc acctctaatg gtcgcgggca acacactcac catgcaatca 

gacggcggcg tggagattac cagcgcccgt tgtgtgaqtg gtacgttagt 

L T M Q S> 
REPEAT 6 > 

20 AAAA PLM VAG NT> 

d REPEAT 5 d > 

410 420 430 440 450 

caggccccgc taaccgtgca cgactccaaa cttagcattg ccacccaagg 
2 5 gtccggggcg attggcacgt gctgaggttt gaatcgtaac ggtgggttcc 

LSI A T Q G> 
REPEAT 7 > 

QAP LTVH DSK> 
REPEAT 6_c > 

30 

460 470 480 490 500 

acccctcaca gtgtcagaag gaaagctagc ccctgacgta gcaagcttac 
tggggagtgt cacagtcttc ctttcgatcg gggactgcat cgttcgaatg 

L A> 

35 a > 
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P L T VSE GK> 
REPEAT 7 b > 

P D V A S L> 
LUNG SURFACTA > 

5 

510 520 530 540 550 
gacaacaggt agaagccttg caagggcagg tacaacactt acaggcggca 
ctgttgtcca tcttcggaac gttcccgtcc atgttgtgaa tgtccgccgt 
RQQ.V E A L QGQ VQHL QAA> 
10 LUNG SURFACTANT PROTEIN D TRIM k > 

560 570 580 590 600 

tttagccaat acaaaaaggt agagttgttt ccaaacggag ccaagaagct 
aaatcggtta tgtttttcca tctcaacaaa ggtttgcctc ggttcttcga 
15 FSQ YKKV ELF PNG> 

LUNG SURFACTANT PROTEIN D TRIM > 

A K K L> 
1 > 



20 >Clal 

I 

610 620 630 I 640 650 

gaacgacgcc caggccccca agagcgaccc atcgatcgta gacaacaaat 
cttgctgcgg gtccgggggt tctcgctggg tagctagcat ctgttgttta 
25 NDA QAP KSD> 

SPA 48-60 1 > 

P S I> 
m > 



30 



V D N K> 
o > 



660 670 680 690 700 

tcaacaaaga acaacaaaac gcgttctatg agatcttaca tttacctaac 

agttgtttct tgttgttttg cgcaagatac tctagaatgt aaatggattg 

35 FNKE QQN AFY EILH LPN> 
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o o Z-DOM7N o ' o > 



710 720 730 740 750 

ttaaacgaag aacaacgaaa cgccttcatc caaagtttaa aagatgaccc 
. 5 aatttgcttc ttgttgcttt gcggaagtag gtttcaaatt ttctactggg 

L N E EQRN A F I QSL KDDP> 
o o Z-DOM7N o o > 



.' 760 770 780 790 800 

10 aagccaaagc gctaacttgc tagcagaagc taaaaagcta aatgatgctc 

ttcggtttcg cgattgaacg atcgtcttcg atttttcgat ttactacgag 
SQS ANL LAEA KKL NDA> 

o o Z-DOM7N o o > 



15 810 820 830 840 850 

aggcgccgaa aagctcgacc gtagacaaca aattcaacaa agaacaacaa 
tccgcggctt ttcgagctgg catctgttgt ttaagttgtt tcttgttgtt 
Q A P K> 
o_> 

20 -■ VDN KFNK E Q Q> 

p_Z-DOM?N p > 

S S T> 

> 



25 860 870 880 890 900 

aacgcgttct atgagatctt acatttacct aacttaaacg aagaacaacg 

ttgcgcaaga tactctagaa tgtaaatgga ttgaatttgc ttcttgttgc 

NAF YEIL HLP NLN EEQR> 
p p_Z-DOM?N p p_ > 

30 

910 920 930 940 950 

aaacgccttc atccaaagtt taaaagatga cccaagccaa agcgctaact 

tttgcggaag taggtttcaa attttctact gggttcggtt tcgcgattga 

NAF IQS LKDD PSQ SAN> 
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>Xhol 
I 

960 970 980 990 1000 

tgctagcaga agctaaaaag ctaaatgatq ctcaggcgcc gaaataactc 
acgatcgtct tcgatttttc gatttactac gagtccgcgg ctttattgag 

*> 
> 

L L A E AKK LND AQAP K> 
p Z- DOM?N p p > 

gag 
etc 



SEQ I.D. No.: 47 - Anti 3 -galactosidase single chain Fv 
fragment 

ATGGCCGAGGTGCAGCTGGTGGAGTCTGGGGGAAGCCTGGTCAAGCCTGGGGGG 

TCCCTGAGACTCTCCTGTGCAGCCTCTGGATTCACCTTCAGTAACTATAGCATGAAC 

TGGGTCCGCCAGGCTCCAGGGAAGGGGCTGGAGTGGATCTCATCCATTAGTGGTAG 

TAGTAGATACATATACTACGCAGACTTCGTGAAGGGCCGATTCACCATCTCCAGAGAC 

AACGCCACGAACTCACTGTACCTGCAAATGAACAGCCTGAGAGCCGAGGACACGGCTG 

TTTATTACTGTGTGAGATCCAGTATTACGATTTTTGGTGGCGGTATGGACGTCTGG 

GGCAGAGGCACCCTGGTCACCGTCTCCTCAGGTGGAGGCGGTTCAGGCGGAGGTGG 

CAGCGGCGGTGGCGGATCGCAGTCTGTGCTGACTCAGCCTGCCTCCGTGTCTGGGT 

CTCCTGGACAGTCGATCACCATCTCCTGCGCTGGAACCAGCAGTGACGTTGGTGGT 

TATAACTATGTCTCCTGGTACCAACAACACCCAGGCAAAGCCCCCAAACTCATGAT 

TTATGAGGACAGTAAGCGGCCCTCAGGGGTTTCTAATCGCTTCTCTGGCTCCAAGT 

CTGGCAACACGGCCTCCCTGACAATCTCTGGGCTCCAGGCTGAGGACGAGGCTGAT 

TATTACTGCAGCTCATATACAACCAGGAGCACTCGAGTTTTCGGCGGAGGGACCAA 

GCTGGCCGTCCTAGGTGCGGCCGCAGAACAAAAACTCATCTCAGAAGAGGATCTGA 

ATGGGGCCGCACATCACCATCATCACCAT 
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Claims : 

1. A modified virus comprising one or more non-native 
polypeptides, which polypeptide comprises one or more 
5 framework moieties each containing one or more binding 
moieties, which polypeptide is capable of being expressed 
in the cytoplasm and nucleus of a mammalian host cell in 
a conformation which is maintained in the absence of a 
ligand for s'aid binding moieties, said conformation 
10 allowing said binding moieties subsequently to bind with 
said ligand, and which polypeptide is capable of transport 
though the nuclear membrane, wherein said modified virus 
has an altered tropism conferred by said binding moieties. 

15 2. The modified virus of claim 1 which is derived from 
a virus selected from the group consisting of 
adenoviruses, retroviruses , lentiviruses, adeno-associated 
viruses, Reoviridae, Picornaviridae, Parvoviridae , 
Papovaviridae and Caliciviridae . 

20 

3. The modified virus of claim 1 or claim 2 which is 
derived from human adenovirus 

4. The modified virus of any one of claims 1 to 3 which 
25 is derived from human adenovirus serotype 5. 

5. The modified virus of any one of claims 1 to 4 
wherein said non-native polypeptide replaces, is 
incorporated into, or forms a fusion protein with, a viral 

30 protein component of the wild type virus. 

6. The modified virus of any one of claims 1 to 5 
wherein said non-native polypeptide and/or the viral 
component protein comprising the non-native polypeptide is 

35 soluble in a cellular environment. 
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7. The modified virus of claim 6 wherein greater than 
30% of the non-native polypeptide or the viral component 
protein comprising the non-native polypeptide is present 
in the soluble fraction of cell lysates of cells 

5 expressing the non-native polypeptide or viral component 
protein comprising the non-native polypeptide. 

8. The modified virus of any one of claims 5 to 7 
wherein said viral protein component is an adenoviral 

10 fiber protein. 

9. The modified virus of claim 8 wherein said non-native 
polypeptide is incorporated into an adenoviral fiber 
protein such that the wild-type fiber knob or cell binding 

15 domain thereof is removed. 

10. The modified virus of any one of claims 1 to 9 
wherein said non-native polypeptide comprises one or more 
further elements that mimic the native structure or 

20 function of a viral component protein in which said 
non-native polypeptide is incorporated or which said 
non-native polypeptide replaces. 

11. The modified virus of any one of claims 1 to 10 
25 wherein said non-native polypeptide does not contain any 

di- sulphide bonds. 

12. The modified virus of any one of claims 1 to 11 
wherein the non-native polypeptide comprises one or more 

30 a-helical structures. 

13. The modified virus of any one of claims 1 to 12 
wherein said non-native polypeptide comprises a framework 
moiety derived from an antibody which is capable of 

35 productive folding in the cytoplasm and subsequent 
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transport into the cell nucleus. 

14. The modified virus of claim 13 wherein said framework 
moiety comprises one or more binding moieties derived from 

5 a CDR of a further antibody. 

15. The modified virus of any one of claims 1 to 12 
wherein said non-native polypeptide is or comprises a 
combinatorial ..'protein or an affibody. 

10 

16. The modified virus of any one of claims 1 to 15 
wherein said non-native polypeptide comprises one or more 
framework moieties derived from a bacterial receptor. 

15 17. The modified virus of any one of claims 1 to 12 or 16 
wherein said non-native polypeptide comprises one or more 
binding moieties which are present within one or more 
loops of a helical bundle and/or one or more of the loops 
connecting these bundles. 

20 

18. The modified virus of claim 16 wherein said 
non-native polypeptide comprises one or more framework 
moieties derived from the immunoglobulin binding Z-domain 
from staphylococcal protein A or an immunoglobulin binding 

2 5 domain from streptococcal protein G. 

19. The modified virus of any one of claims 1 to 18 
wherein said non-native polypeptide comprises one or more 
framework moieties which comprise a non native 

30 trimerisation motif. 

20. The modified virus of claim 19 wherein said 
non-native polypeptide comprises one or more framework 
moieties which comprise the neck region peptide from human 

35 lung surfactant protein D. 
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21. The modified virus of any one of claims 1 to 20 which 
comprises two or more different non-native polypeptides. 

5 22. The modified virus of claim 21 which comprises a 
first non-native polypeptide which binds a target cell and 
a second non-native polypeptide which binds a production 
cell or permissive cell. 

10 23. The modified virus of any one of claims 1 to 22 
wherein said non-native polypeptide comprises a cleavage 
site positioned in a location that enables a binding 
moiety of the non-native polypeptide to be cleaved from 
the modified virus. 

15 

24. The modified virus of any one of claims 1 to 23 
wherein said virus comprises a modified viral component 
comprising said non-native polypeptide and a corresponding 
unmodified viral component e.g. a wild-type fiber and a 

20 modified fiber. 

25. The modified virus of any one of claims 1 to 24 
wherein said non-native polypeptide comprises a binding 
moiety capable of binding to a cell specific ligand which 

25 may optionally be Prostate Specific Membrane Antigen, EGF 
receptor, Her-2/Neu, VEGF receptor, CD22, gpl20, 
MHC/peptide complexes or membrane structures or surface 
molecules expressed or present on proliferating cells, 
tumor cells or virus infected cells. 

30 

26. The modified virus of any one of claims 1 to 25 which 
further comprises a site for insertion of one or more 
desired therapeutic genes or nucleic acid molecules. 

35 27. The modified virus of claim 26 which comprises 
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transgenes encoding cytosine deaminase and/or uracil 
phosphoribosyl transferase either as separate genes or 
together as a bifunctional fusion gene. 

28. The modified virus of any one of claims 1 to 27 which 
further comprises a viral component which is replaced with 
an equivalent component from a different serotype or is 
modified such that binding of said virus by antibodies 
pre-formed to /the wild type component is reduced. 

29. The modified virus of claim 28 v/herein said viral 
component is a hexon protein. 

30. A modified viral protein comprising a non-native 
15 polypeptide as defined in any one of claims 1 to 25. 

31. A cell containing a modified virus or viral protein 
as defined in any of the claims 1 to 30. 

20 32. A permissive cell for a modified virus as defined in 
any one of claims 1 to 2 9 which is capable of being 
cultured to propagate said modified virus. 

33. A method for producing a modified virus as defined in 
25 any one of claims 1 to 2 9 in cell culture, comprising the 
steps of: 

i) genetically modifying a virus to produce a modified 
virus comprising one or more non-native polypeptides, 
which polypeptide comprises one or more framework moieties 

3 0 each containing one or more- binding moieties, which 
polypeptide is capable of being expressed in the cytoplasm 
and nucleus of a mammalian host cell in a conformation 
which is maintained in the absence of a ligand for said 
binding moieties, said conformation allowing said binding 

35 moieties subsequently to bind with said ligand, and which 
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polypeptide is capable of transport though the nuclear 
membrane, wherein said modified virus has an altered 
tropism conferred by said binding moieties; 
ii) infecting permissive cells with said modified virus; 
5 iii) culturing said cells to produce the virus; 
and 

iv) harvesting, and optionally, purifying the modified 
virus produced. 

10 34. A method of regulating the replication of a modified 
virus as defined in any one of claims 1 to 2 9 comprising 
the steps of: 

i) constructing a modified virus such that a cleavage 
site is positioned between a binding moiety required for 

15 cell infection and the remainder of the recombinant viral 
component of which the binding moiety forms part; and, 

ii) bringing said recombinant virus into contact with a 
cleavage agent or cleavage means capable of cleaving said 
binding moiety from said viral component and thereby 

2 0 preventing the recombinant virus from undergoing further 

infection cycles. 

35. A method of determining the suitability of a non- 
native polypeptide for use in the preparation of a viral 

25 vector by determining its solubility in a cell system. 

36. A method as claimed in claim 35 which comprises the 
steps of: 

i) expressing said non-native polypeptide or a viral 

3 0 component protein comprising said non-native polypeptide 

in permissive cells; 

ii) subjecting the cells to lysis to produce a cell 
lysate; 

iii) separating the soluble and insoluble fractions of the 
35 cell lysate; 
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iv) analysing the soluble and insoluble fractions of the 
cell lysate for the content of said non-native polypeptide 
or viral component protein comprising said non-native 
polypeptide / 

- 5 and, 

v) comparing the relative content of said non-native 
polypeptide or viral component protein comprising said 
non-native polypeptide in the soluble and insoluble 
fractions 

10 

37. A method of assaying the solubility of a non-native 
polypeptide as defined in any one of claims 1 to 25 or a 
modified viral protein as defined in claim 30, which 
method comprises the steps of: 

15 i) expressing said non-native polypeptide or a viral 
component protein comprising said non-native polypeptide 
in permissive cells; 

ii) subjecting the cells to lysis to produce a cell 
lysate / 

20 iii) separating the soluble and insoluble fractions of the 
cell lysate; 

iv) analysing the soluble and insoluble fractions of the 
cell lysate for the content of said non-native polypeptide 
or viral component protein comprising said non-native 

25 polypeptide; 
and, 

v) comparing the relative content of said non-native 
polypeptide or viral component protein comprising said 
non-native polypeptide in the soluble and insoluble 

3 0 fractions. 

38. The modified virus of any one of claims 1 to 29 for 
use in therapy. 



35 



39. Use of the modified virus of any one of claims 1 to 
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2 9 in the preparation of a medicament for the treatment of 
tumour cells or proliferating cells. 



5 



40. A pharmaceutical composition comprising a modified 
virus of any one of claims 1 to 2 9 and a pharmaceutically 
acceptable carrier or excipient . 
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